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ENCODED REACTION CASSETTE 



T^hnVr^l yield 

The invention relates to methods and reagents 
for assaying cleavage and ligation reactions and the 
activity of catalytic molecules which promote 
cleavage and ligation reactions. More particularly, 
the invention relates to assays which employ 
substrates covalently linked both to a solid phase 
matrix and to a nucleotide strand encoded to identify 
the substrate and including polymerization chain 
reaction (PCR) primer sequences for amplifying such 
encoded sequences. 

si-nl-ftTnan*- of Sflvprnmwt. Eights; 

This invention was made, in part, with 
government support under Grant No.'s GM 48351 from 
the National Institutes of Health. The U.S. 
government may have certain rights in the invention. 

r 

fiarVgrou nri of thp Tnventiont 

Cleavage and ligation reactions can be assayed 
by monitoring either the appearance of reaction 
products or the disappearance of substrates. For 
example, protein or peptide proteolysis can be 
monitored by following the appearance of cleavage 
products. Cleavage products may be separated from 
substrate by gel electrophoresis or by 
chromatographic separation and monitored by UV 
absorption or colorimetric assay. Similarly, 
polynucleotide ligation can be monitored by following 
the appearance of ligation products. 




If the cleavage or ligation activity is low, the 
detection signal may require amplification to detect 
the reaction products. For example, radio labeled 
substrates may be synthesized and the resulting 
reaction products may be detected by radio 
immunoassay. Alternatively, if the substrate is 
conjugated to an enzyme which catalyzes a 
color imetric reaction, the resultant reaction 
products may be detected by means of an enzyme 
immunoassay. Highly sensitive enzyme immunoassays 
have been developed. When employed at their limit of 
sensitivity, the signal produced by an enzyme 
immunoassay in response to the presence of reaction 
product becomes comparable to the background signal. 

In the catalytic antibody field, antibody 
libraries are routinely screened in order to identify 
catalytically active antibody. There are two 
limitations to creating a useful catalytic antibody, 
viz.: l.) designing a productive immunogen which is 
an analog of the substrate or reaction intermediate 
and producing an antibody library therewith; and 2.) 
screening the resultant antibody library for the 
desired catalytic activity. In short, the 
limitations are display and detection. The problem 
of display can be facilitated by converting the 
antibody diversity into a combinatorial library in 
phage where the recognition and replication functions 
are linked in a single entity and monitored by simple 
binding events. However, the screening of phage 
particles which display only a small number of 
catalytic antibody molecules requires a highly 
sensitive assay methodology. In such instances, the 
displayed catalytic activity may be only slightly 
higher than background activity. Prior art methods 



for assaying cleavage and ligation reactions 
sometimes lack the requisite sensitivity for 
identifying small quantities of low activity 
antibody, in some instances , catalytic antibody 
having a low level of catalytic activity can be 
useful if it is the only antibody identified to have 
such catalytic activity and/or if it is employed in 
an "evolutionary scheme" for generating antibodies 
having higher levels of catalytic activity. 
Accordingly, the sensitivity of the assay employed 
for screening an antibody library, may be the 
limiting factor with respect to the identification of 
useful antibody. The ease or difficulty of the assay 
may also limit the willingness of workers to perform 
these assays. 

In instances in which one can obtain antibodies 
to the reaction product or substrate, an immuno-PCR 
assay may be constructed and employed as a detection 
system. (T. Sano et al., Science (1992): vol. 258, 
pages 120- 122) . An immuno-PCR assay is similar to 
an enzyme immunoassay except that the enzyme-antibody 
conjugate is replaced by an antibody conjugated to a 
PCR amplifiable polynucleotide strand. Immuno-PCR 
assays are highly sensitive. However, at very. low 
levels of antigen, the immuno-PCR assay is limited by 
non-specific binding of the antibody-polynucleotide 
conjugate. 

What is needed is a highly sensitive assay for 
detecting cleavage or ligation reactions. The assay 
should not employ an antibody conjugate and should 
have the lowest possible background signal. The 
assay should be labor efficient and should be 
adaptable for assaying any cleavage or ligation 




reaction. 



Summary of the Invention: 

The invention is directed to an encoded reaction 
cassette employable for detecting cleavage or 
ligation reactions and to assays employing such 
cassettes. Cassettes may be constructed for assaying 
any cleavage or ligation reaction. The cleavage or 
ligation reaction may be catalyzed or spontaneous, 
i.e., without benefit of a catalytist. 



A reaction cassette is designed for the highly 
sensitive detection of the making and breaking of 
chemical bonds. The system may be employed as a 
companion device to be used in the search for 
antibody and other novel catalysts* The cassette may 
also have important clinical applications in the 
design of diagnostic reagents. In its fully encoded 
format this methodology is capable of both detecting 
and decoding chemical events. 



The assay of the present application was 
originally developed for application in the field of 
antibody catalysis. However, it may also be employed 
for assaying any synthetic or enzymatic reaction, 
including those important to diagnostic assays in 
medicine. When one is using the cassette to search 
for a single reaction, only one DNA sequence is 
necessary. However, in a fully encoded format, one 
can test multiple substrates simultaneously by using 
unique polynucleotide sequences for each substrate. 
The nature of the reaction that occurred may be 
simply determined by the sequence of the 




polynucleotide either after the PCR reaction or upon 
cloning of the polynucleotide according to the method 
of A. p. Mirzabekov {TISTECH (1994): vol. 12, pages 
27-32) . In essence, one can. create an encoded 

5 combinatorial library of substrates to learn about 

reaction specificities. One, can design systems in 
which a combinatorial library of catalysts is 
screened against a combinatorial library of 
substrates to find new catalysts and refine their 

10 substrate specificity in a, single operation. 

Finally, the disclosed method for constructing 
encoded cassettes may be adapted to an operating 
procedure in which a substrate cassette is built as a 
companion to any experiment where one is searching 

15 for a new catalyst. This enables the researcher or 

investigator to design experiments which are 
independent of whether the reaction products can be 
easily assayed by prior art methods. Thus,, each time 
one contemplates searching for an enzyme the first 

20 step is to construct an encoded reaction cassette to 

detect reactivity. 

More particularly, the invention is directed to 
an encoded reaction cassette for assaying a cleavage 

25 reaction. The reaction cassette includes a substrate 

covalently linked to a solid phase matrix, wherein 
the substrate is of a type which is suspectable to 
cleavage by means of the cleavage reaction. Linked 
to the substrate is a first polynucleotide which 

30 includes a first PCR primer sequence, an encoding 

~s¥quence~r and a second - PCR primer -sequence . The 
encoding sequence is positioned between the first and 
second PCR sequences. In a preferred embodiment, the 
encoded reaction cassette may also include a first 

35 and second linker. The first linker covalently links 




the solid phase matrix to the substrate . The second 
linker covalently links the substrate to the first 
polynucleotide. Peptides are preferred substrates. 
However, any cleavable substrate may be employed. In 
an alternative* embodiment the encoded reaction 
cassette may include one or more additional 
polynucleotides linked to the substrate for further 
amplifying the signal. These additional 
polynucleotides also include the same first PCR 
primer sequence , encoding sequence, and second PCR 
primer sequence and may be linked to the substrate 
via additional linkers. 



The invention is also directed to an admixture 

15 of cleavage products from an encoded reaction 

cassette which has been exposed to a cleavage agent. 
This admixture includes a solid phase cleavage 
product and a soluble phase cleavage product. The 
solid phase cleavage product includes a first 

20 cleavages-product- of~a~substrate covalently linked to 

the solid phase matrix. The soluble phase cleavage 
product includes a second cleavage product of the 
substrate covalently linked to the first 
polynucleotide. The invention is also directed to 

25 a method for detecting a cleavage agent within a 

sample. The method comprises the following steps. 
In the first step, the sample is admixed with an 
encoded reaction cassette under reaction conditions 
for promoting cleavage of the substrate to produce a 

30 cleavage products. If the sample has cleavage 

activity ," cleavage prdductsnmi^be" generated , i.e., 
a solid phase cleavage product and a soluble phase 
cleavage product. In the second step, soluble phase 
cleavage product is separated and isolated from the 

35 solid phase cleavage products and from uncleaved 
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encoded reaction cassettes. In the third step,. the 
encoding sequence of the polynucleotide of the 
• soluble phase cleavage product isolated in the second 

step is amplified by means of a polymerization chain 
: *0i 5 reaction (PCR) . In the fourth step /the amplified 

1 ' ' encoding sequence amplified is detected. And, in an 

alternative fifth step, the signal obtained in the 
fourth step is correlated with Known substrates 
and/or cleaving agents to obtain quantitative 
10 results . 



15 



20 



25 



30 



35 



Another aspect of the invention is directed to 
an admixture of unligated reactants for producing an 
encoded ligation cassette for assaying a ligation 
reaction. The admixture may include a solid phase 
ligation component and a soluble phase ligation 
component. The solid phase ligation component 
includes a first ligation reactant covalently linked 
to a solid phase matrix. The soluble phase ligation 
component includes a second ligation reactant 
covalently linked to a first polynucleotide. As 
before, the first polynucleotide includes an encoding 
sequence positioned between a first PCR primer 
sequence and a second PCR primer sequence. The first 
and second ligation reactants are capable of ligation 
in the presence of a ligating agent to join the solid 
phase and soluble phase ligation components so as to 
form an encoded ligation cassette. In a preferred 
embodiment, the first and second ligation reactants 
are fragments of a ligatible oligonucleotide. 
However, any pair of ligatible molecules may be 
employed. The encoded ligation cassette is analogous 
to the encoded reaction cassette except that a 
ligation product separates the solid phase matrix 
from the first polynucleotide. However, unlike the 



o 
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encoded reaction cassette, the encoded ligation 
cassette need not be susceptible to cleavage by a 
cleavage agent. Preferred ligating agents have 
ligation activity with respect to the first and 
5 second ligation reactarits. Moire particularly, a 

preferred ligation agent is polynucleotide ligase and 
preferred first and second ligation reactants are 
ligatible oligonucleotides. 

10 The invention is also directed to a method for 

detecting a nucleotide iigating agent within a 
oligonucleotide sample. The method includes several 
steps. In the first step, the sample is combined 
with an admixture of ligation components. The 

15 resultant admixture is the incubated for producing an 

encoded ligation cassette. In the second step, the 
encoded- ligation cassette formed above is then 
separated and isolated together with unligated 
portions of the solid phase ligation component from 

20 *the unligated portion of the soluble phase ligation 
component. The encoding sequence of the 
polynucleotide of the encoded ligation cassette may 
then be amplified by means of^PCR, detected, and 
correlated with the presence of the ligation agent. 

25 Polynucleotide ligase is a preferred ligation agent. 

In this instance, the ligation product included 
within the encoded ligation cassette is a 
polynucleotide susceptible to ligation by the ligase. 



The encoded reaction cassette operates through 
the liberation (bond cleavage event) or capture (bond 
formation event) of a polynucleotide that can be 
amplified and decoded. Chemical methods for the 
synthesis and assembly of this device are disclosed 
together with a detailed characterization and 



optimization of the parameters that influence its 
sensitivity and practicality. In the bond cleavage 
detection mode, using a-chymotrypsin as an exemplary 
catalyst, the specificity of the reaot ion cassette is 
demonstrated through the selective recognition and 
cleavage of peptide substrates differing in sequence 
by only one amino-acid (Ala 2 -Tyr-Ala 2 versus Ala 2 -Phe- 
Ala 2 ). The cassettes sensitivity (0.1-1 pmoles, 5-50 
nM) is the same whether using 0.01 mg (1 bead) or 10 
mg (10000 beads), but is dependent on the 
concentration of a-chymotrypsin. The amount of this 
enzyme, however, can be decreased to as little as 
2400 molecules provided the concentration is kept * 5 
nM. In the bond formation detection mode, a- 
chymotrypsin catalyzed peptide bond formation was not 
possible because of hindrance , reasons , whereas 
chemically catalyzed bond formation (reductive 
amination of an aldehyde) could be performed, and was 
used as the prototype reaction for the demonstration 
of the second part of the principle of the reaction 
cassette, namely, the detection of the formation of a 
chemical bond. 

ftr^f nesciHp^on of the Figures: 

Figure 1 illustrates the principle of the 
reaction cassette. 

Figure 2 illustrates the spacers which are used 
for the assembly of the reaction cassettes. 

Figure 3 illustrates the synthetic scheme for 
the preparation of spacers III-VII. The indicated 
synthetic steps are as follows: (a) K 2 CO„ DMF, 90-100 
'C; (b) CS 2 C0 3 , KI, DMF, 90-100'C; (c) Cs 2 C0 3 , n-Bu,NI, 
DMF, 85°C; (d) Ag 2 0, n-Bu<NI or KI , THF, with or 



^-^•V#^*>^ > ' ; , V ^^^^ 
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without sonication, 20°C; (e) NaH, THF, 20°C; (f) Na, 
THF, 20°C; (g) DHP, MeOH, 1 drop HCl coned; (h) DEAD, 
PPh 3 , Phtalamide, THF, 20°C; (i) DMTrCl, pyridine, 
20°C; (j) (NH 2 ) 2 , ethanol, 80°C; (k) Tsci, pyridine, 
5 20°C; (1) Na, 23; (m) PDC, DMF, 20°C; (n) Faocci, 

Na 2 C0 3 , dioxarie-H 2 0, 0°C to 20°C; (o) (Boc) 2 0, Et 3 N, 
DMF, 0°C to 20°C. 

Figure 4 illustrates the substrates studied in 
10 the detection of bond cleavage or formation* 

Figure 5 illustrates the synthesis and use of 
the reaction cassette for the detection of a- 
chymotrypsin catalyzed bond cleavage. The top 

15 reaction shows the use of SPPS (solution phase 

peptide synthesis) on a NovaSynKD (CH 2 CH 2 NH 2 ) matrix 
with the spacer VI or VII, Fmoc-Ala 2 , Fmoc-Tyr (OSit- 
BuMe 2 ) , and spacer l. The next reaction shows the 
attachment of spacer VIII which is followed by DNA 

20 synthesis using B-cyanoethyl phosphoramidites to 

afford a fully protected reaction cassette which is 
sequentially deprotected with concentrated ammonia 

\ and TBAF followed by extensive washings* The 

cassette is then ready for use in bond cleavage 

25 detection upon exposure to a-chymotrypsin followed by 

PCR amplication and assay. 

Figure 6 illustrates the synthesis of the east 
side (S9, 810) of the reaction cassette for the 
30 detection of enzymatically and chemically catalyzed 

bond formation. 

Figure 7 illustrates the synthetic approach for 
the detection of enzyme catalyzed bond formation: 
35 Coupling of the west side (C34 or C35 - derived from 
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commercially available compounds 24 or 25) to the 
east side (8» - synthesis shown in figure 6) . The 
cassette is then ready for use in an enzyme catalyzed 
bond formation upon exposure to a-chymotrypsin 
followed by PCR amplication on a solid support and 
detection by fluorescent assay. 

Figure 8 illustrates the synthetic approach for 
the detection of chemically catalyzed bond formation: 
Coupling of the west side (C3«) to the east side 
(810) in a 60% DMSO/0.1 borate buffer pH 10 solution. 
The mixture is then reduced using 

sodiumcyanoborohydride, followed by extensive washing 
and then PCR amplication on a solid support with 
detection by a fluorescent assay. 

Figure 9 illustrates the templates studied to 
improve the polymerase chain reaction (PCR) . 
Figure 10 illustrates the general scheme of the first 
generation reaction cassettes (C1-C6) . 

Figure 11 illustrates the general scheme of the 
second generation reaction cassettes (C7-C25) . 

Figure 12 illustrates bond cleavage detection. 
PCR products of samples from the medium containing 
the reaction cassette indicated on top of each lane, 
with or without a-chymotrypsin: Lanes 1, 3, 5, and 7 
correspond to the PCR products of a sample from the 
medium containing a-chymotrypsin (5 nM) , and Cassette 
C7, CIO, C31, and C32 respectively; Lanes 2, 4, 6, 
and 8 correspond respectively to the same experiments 
without a-chymotrypsin. The positive control (Lane 9) 
corresponds to the PCR product of an authentic sample 
of the DNA 45mer. The negative control (Lane 10) 




' -12- ' 

corresponds to the same experiment without the DNA 
45mer. TG-NH 2 and NSKD-NH 2 correspond respectively to 
unfunctionalized TentaGel and NovaSyn KD resins 
bearing an amino terminus group. 

5 ' ' > 

Figure 13 illustrates the general scheme of the 
third generation reaction cassettes (C26-C32). 

Figure 14 illustrates bond formation detection. 
10 PCR products of one bead from the medium containing 

the functionalized or unfunctionalized resin (NSKD- 
VI 2 -NH 2 , TG-NH 2 , C34, C35, or C36) , with or without 
the functionalized polynucleotide (89, 810) , and with 
or without a-chymotrypsin, as indicated on top of 
15 each lane. Positive and negative controls are as in 

Figure 12. NSKD-V1 2 -NH 2 corresponds to NovaSyn KD 
resin bearincj two spacers VI and an amino terminus 
group; TG-NH 2 corresponds to TentaGel resin bearing an 
amino terminus group. 

20 

Figure 15 illustrates the chemical structure of 
spacer units (L-I to L-IV) used in the construction 
of the cassettes. 

25 Figure 16 illustrates the reaction specificity 

of a peptide reaction cassette with respect to 
indicated proteolytic enzymes. 

Figure 17 illustrates the reaction sensitivity 
30 of the peptide reaction cassette of Figure 16 as a 

function of time. 



35 
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DESIGN FEATURES OF THE CASSETTE 

The invention combines two prior art methods, 
viz.: 1. the technique of synthesizing polymers ' on 
solid support; and 2. the technique of PCR 
(Polymerase Chain Reaction) . The overall approach is 
illustrated in Figure 1. 

The central operative feature of the reaction 
cassette is the liberation (cleavage event) or 
capture (bond formation) of a polynucleotide 
containing two primers (Figure 1) . Thus, when an 
appropriately functionalized solid support (Figure 1) 
is exposed to a catalyst or a library of catalysts 
that are able to selectively cleave the reaction 
cassette at the substrate juncture, single stranded 
DNA (polynucleotide) will be released and can be 
amplified by the PCR. Furthermore, the sequence of 
the polynucleotide may be chosen in such a way that 
it reflects the nature of the substrate, so that a 
library of encoded substrates can be designed. When 
substrate libraries such as thtise are exposed to a 
library of catalysts one can identify not only the 
catalyst but also the substrate since the sequence of 
the cleaved polynucleotide encodes and thus 
identifies which substrate sequence has been cleaved. 
In addition to the above methodology, our cassette 
reaction technique allows one to follow a bond 
formation event via the inverse pathway (figure 1) . 
In this initial report we describe the application of 
our methodology to the study of enzyme catalyzed bond 
cleavage. 



Three generations of reaction cassettes are 
disclosed herein. The first generation is based on 



Controlled Pore Glass (CPG) solid support. The 
second generation is based on a 70% w/w copolymer of 
polyethyleneglycol and polystyrene (TentaGel). The 
third generation is based on a composite matrix 
(Novafeyii KD) V ' ^ * ; ' 

Hatrix Supports 

The ideal support for a reaction cassette is 
mechanically and chemically compatible with the 
synthetic techniques used to assemble the cassette 
(peptide and DNA synthesis) while also being 
accessible to the catalyst employed (enzymes, 
abzymes, etc.)* Polystyrene based matrices, for 
instance, are too hydrophobic for their use in enzyme 
catalyzed reactions while CPG was inappropriate due 
to poor chemical and mechanical stability (liberation 
of the polynucleotide-substrate conjugate leads to an 
undesired background reaction) . TentaGel was found 
to be mechanically stable and chemically compatible 
with peptide substrates and DNA synthesis. However, 
this matrix is not preferred because it is to 
hydrophobic in character and does not allow optimal 
accessibility to the enzyme. For this reason, the 
third generation cassettes were*! designed and are 
disclosed herein. NovaSyn KD resin, a composite 
matrix derived from polydimethylacrylamide gel 
retained within a macroporous structure of kieselguhr 
inorganic particles was used. This matrix is more 
hydrophilic, mechanically and chemically stable and 
more compatible with biocatalysis. (Meldal, M. ; 
Svendsen, I.; Breddam, K. ; Auzanneau, F-I. Proc. 
Natl. Acad. Sci. USA 1994, 91, 3314.) 

Spacers: 

The spacers are preferably heterobifunctional, 

chemically stable, and compatible with both substrate 
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and DNA synthesis. One end should be readily 
attachable to the substrate portion and the other 
should be equipped with a hydroxyl functionality for 
DMA synthesis. . . ' % 

5 , 

The first spacer on the cassette is located 
between the matrix and the substrate, while the 
second between the substrate and the polynucleotide 
portion (Figure 1) . The length of both spacers is 

10 crucial for the success of the methodology. 

(Nielsen, J.; Janda, D. K.; Brenner, S. J. An. Chem. 
Soc. 1993, 115, 9812.) For CPG and NovaSyn KD, a 
distance of at least 30-40 k between the resin and 
the substrate is preferred in order to avoid 

15 hindrance that may result from proximity to the 

matrix core. This is a distance corresponding to 
twice the length of spacer II. TentaGel is already 
endowed with a long polyethyleneglycol chain which 
serves as the first spacer. The length of the second 

20 spacer (between the substrate and the polynucleotide) 

is disclosed to be less important for the catalytic 
i reaction studied but was critical for DNA synthesis 
(vide infra) . 

25 Different types and combinations of spacers for 

the construction of the cassettes are disclosed in 
Figure 2. Spacers I-VII were synthesized while VIII 
and IX (Glen Research) and Z (Millipore) are 
commercially available. Spacers I and II were 

30 prepared according to literature procedures and were 

introduced in our initial design (first and second 
generation cassettes) . The synthesis of Spacer I is 
disclosed by Schaller et al. (Schaller, H.; Weimann, 
G.; Lerch, B.; Khorana, H. G. J. Am. Chem. Soc. 1963, 

35 85, 3821.) The synthesis of Spacer II is disclosed 
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by Nielson et al. (Nielsen, J.; Janda, K. D. Methods 
1994, 6, 361.) Spacer I Is the shortest spacer. In 
many instances, it is too tight for optimal 
reactivity between catalyst and substrate. Its 
5 • , preferred use is in sequence with other spacers. 

Spacer II was found to be chemically labile at the 
acetoxy bond under the strongly acidic and basic 
conditions required in SPPS (Boc-amino acid 
chemistry) and DNA synthesis respectively. 
10 (Stewart, J.; Young, J. Solid Phase Peptide 

Synthesis ; Pierce Chemical Company:' Rockford, 
Illinois, 1984.) 

Spacers III and IV have free amino groups and 

15 can be attached to a substrate with a leaving group 

or carboxy terminus. Spacers V-VII are the longest; 
they can form an amide bond with peptide substrates 
and can be attached in sequence of two or more as 
opposed to spacers I, III, and IV. Spacers V-VII 

20 were used in combination with spacers 1/ III or IV. 

Interconversion of these spacers allows for the 
synthesis of DNA at the terminal hydroxyl group 
I generated. Spacer VIII can be automatically 
introduced between the substrate and the 

25 polynucleotide on a DNA synthesizer, this provided 

that I, III or IV are already attached, or the 
substrate has a hydroxyl functionality. The 
deprotection step (hydrazine/ethanol (1/1)) required 
to quantitatively free the amino terminus of V was 

30 found to partially cleave the substrate from the 

resin. At lower hydrazine concentrations (0.2 M) the 
reaction is slow (24 hours) and incomplete (80-90%) . 
(Bertozzy, C. R. ; Bednarski, M. D. J. Org. Chem. 
1991, 56, 4326.). This spacer is preferably 

35 employed with for more stable substrates. Spacers 
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VIXI-X were used to derivatize the oligonuclieotide at 
the 3 1 end (DNA synthesizer ready) for studies on 
bond formation detection. The synthetic scheme for 
the preparation of. IXI-VXI is shown in Figure 3 . 



Substrates s 

a-Chymotrypsin is employed as an exemplary 
catalyst for the implementation of the reaction 

10 cassette. This enzyme is known to cleave amide bonds 

at the C-terminus of aromatic and hydrophobic amino 
acids such as Leucine, Methionine, Phenylalanine, 
Tyrosine. (Hess, G. P. in The Enzymes, Boyer P. D. 
(Ed.). Academic Press: New York, 1971, pp. 213-248.) 

15 This enzyme is also known to catalyze the formation 

of a peptide bond between Tyrosine methyl ester and 
the primary amine of Alanine. (Lane, J. W.; Hong, 
X.; Schwabacher, A. W. J . Am. Chejn. Soc. 1993, 115, 
2078.) Substrates 81-87 (Figure 4) were assembled 

20 according to standard Boc and/ or Fmoc methodologies. 

(Atherton, A.; Sheppard, R. C. Solid Phase Peptide 
Synthesis: A Practical Approach, Oxford University 
Press: Oxford, 1989.) 

25 The peptides 81 and 82 were used in the first 

generation, while 83-87 were used in the second and 
third generation cassettes. a-Chymotrypsin cleaves 
substrate 84 '10 fold more efficiently than S3, and 
"10 3 fold more efficiently than 81, as estimated from 

30 data obtained for similar peptides, viz., Baumann, W. 

K.; Bizzozero, S. A.; Dutler, H. Eur. J. Biochem. 
1973, 39, 381; and Fisher, G.; Bang, H. ; Berger, E.; 
Schellenberger, A. Biochimica Biophysica Acta 1984, 
791, 87. This characteristic may be employed to 

35 probe the specificity of a-chymotrypsin within our 
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cassette system. Peptide 84 is one of the best 
substrates of ct-chymotrypsin. Accordingly, this 
substrate is employed herein to exemplify a preferred 
lower limit of sensitivity of the cassette system. 
5 Peptide 87 was prepared in order to probe the 

interaction at the P4, P5; P4 1 and P5 1 sites of a- 
chymo trypsin. Peptide 88 was prepared in 5 steps 
(Figure 7) using standard solution phase peptide 
chemistry (Supplementary Material) and was used in 
10 the detection of enzyme catalyzed bond formation. 

The synthetic procedure used to assemble the cassette 
is illustrated in Figure 5 with NovaSyn KD as solid 
support using spacers I, VI (or VII) and VIII. This 
synthetic scheme is rapid and straightforward since 
15 Fmoc-Ala 217 and Fmoc-Tyr (OSit-BuMe 2 ) were used as 

building blocks in this synthesis. (Fisher, P. M. 
Tetrahedron Lett. 1992, 33 # 7605.) The first and 
second spacers are attached according to standard 
SPPS. Spacer VIII is introduced automatically on the 
20 DNA synthesizer employing the same procedure used to 

assemble the polynucleotide. Indeed, this spacer can 
be attached in sequence in one or more copies. Upon 
completion of DNA synthesis, the polynucleotide is 
deprotected, followed by a 15 min incubation in 1 M 
25 TBAF in THF to remove the silicone protecting group 

of Tyrosine. After this step the cassette is ready 
for use in the bond cleavage detection mode. 



For bond formation the units of the substrate to 
3 0 be attached on the resin are prepared separately. 

Figure 6 shows the synthesis of the east side of the 
reaction cassette: The polynucleotide-peptide 
conjugate (89) and the polynucleotide amino- 
functionalized at the 5' end (810). The 89 unit was 
35 prepared following conventional DNA synthetic 



procedures using methyl phosphoramidite monomers that 
are stable to the conditions of SPPS. The unit 810 
could be prepared in a similar manner (same monomers) 
or using Expedite p-cyanoethyl phosphoramidites 
monomers 1 (Millipbre) which are more easily 
deprotected. 

DMA synthetic procedures: 

Standard DNA synthesis was improved upon using 
CPG or polystyrene solid supports. CPG does has 
less preferred chemical and mechanical properties 
(vide supra) . Polystyrene based matrices have less 
preferred hydrophobic characteristics. 

The standard 1 pinole procedure of the 394 
Applied Biosystem DNA synthesizer (termed procedure 
A) using phosphoramidites chemistry was modified for 
the second generation cassettes (termed procedure B) 
as described previously. (Gait, M. J. 
Oligonucleotide Synthesis : A practical Approach: 
oxford University Press: New York, 1990.) Procedure B 
was further modified (termed procedure C) in order to 
adapt it to CPG, NovaSyn KD and TentaGel resins (see 
experimental section) . Table 1 shows the 
improvements in average stepwise yield (ASWY) and 
overall yield (OAY) using procedures A-C on CPG 

(cassette CI) , TentaGel (cassette CIO) and NovaSyn KD 

(cassette C29) . 



/; Table 1. 
'ftr?-VN*' : . - ASWY (average stepwise yield) and OAY (overall 

;v. yield) for DNA synthesis using different 

'» * * - * 

; ;, resins and modified procedures. 





Procedure A ' 


Procedure B * 


Procedure C e 


| CPG (CI) 


98.8%, 56.1% 


99.3%, 72.9% 


99.4%, 76.3% 


TentaGel (CIO) 


rails 6 


96.9%, 24.2% 


99.2%, 69.7% 


NovaSyn KD 


rails 4 


95.0%, 9.9% 


98.5%, 42.4% 


(C29) 









a 1 micromole standard procedure of the 394 Applied 
; ' . Biosystem DNA Synthesizer. I 

Jb Same as procedure A but modified as in reference 5* 
c Same as procedure A, but modified as in the methods 
section. 

d An ASWY < 95% is considered a failure because the 
OAY in this case is < 9*9%. 

m 
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Two other factors played a major role in the 
success of the DNA synthesis, the first being the 
length of the spacer between the substrate and 
polynucleotide and, the second being the loading of 
the resin. . In general, longer spacers between the 
substrate and the polynucleotide and lower loadings 
result in better ASWY's for DNA synthesis (see Tables 
2-4). For instance, on TentaGel with procedure B, 
the ASWY is 96.9% with I (CIO), 98.8% with VI (or 
VII) and I (C20), and 99.8% with 2(VI) and I (C21) . 
On the same resin with I, and procedure B, when the 
loading is 172.1 jimoles/g (C12) DNA synthesis fails, 
and when the loading is 59.9 /anoles/g (CIO), DNA 
synthesis works successfully with an ASWY of 96.9%. 
With high loading, the synthesis fails, regardless of 
the presence or absence of a long spacer between the 
substrate and the polynucleotide. 

In all cases studied, the yield of DNA synthesis 
after deprotection with concentrated ammonia varies 
from 24% to 100% depending on the cassette ("70% on 
average, see last column of Tables 3 and 4). Without 
the substrate portion, the deprotection is 
quantitative. Taken together with the fact that 
concentrated ammonia affects the matrix to a lesser 
extent before DNA synthesis (10-20% loss), these 
results indicate that during DNA synthesis, labile 
bonds, branching points, etc., are generated which 
upon treatment with cone, ammonia are eliminated. 
The amplitude of this side reaction may sometimes 
vary from one cassette to another. 
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A preferred synthetic strategy utilized to build 
the east side of the reaction cassette for the 
detection of enzyme catalyzed bond formation is shown 
5 in Figure 6.' This unit (89) requires the use of more 

stable ' monomers for DNA synthesis. A coupling step 
with Fmoc-Ala 2 has to be performed in the presence of 
a strong base (DIEA) . Under such conditions, the 
usual p-cyanoethyl protecting group (used in bond 

10 cleavage detection) can be lost and thus lead to 

undesired side reactions. The methyl phosphoramidite 
monomers proved to be more appropriate as a result of 
their inherent stability. For chemically catalyzed 
bond formation this limitation does not apply, and 

15 methyl or 3-cyanoethyl phosphoramidite monomers can 

be used to prepare 810 (Figure 6) . 

PCR protocols: 

Two critical aspects of the encoded reaction 
20 cassette are the sequence and length of the tag. In 

terms of the tag length, it should be as short as 
possible for a better OAY in DNA synthesis. In 
l - addition, it must be specifically amplified with a 

high sensitivity. 

25 

Several templates (A-F, Figure 9) were 
characterized under various PCR conditions, e.g., pH, 
[MgCl 2 ], [primer], [enzyme], cycles, and temperature. 
Templates A-F differ by the sequence of the primers 

30 and G/C content. These two factors are responsible 

for template tertiary structure and mispriming. Upon 
analyzing tertiary structure and self -complementary 
sequences within the template and with the primers 
using computer modeling (DNASIS, Amplify and OLIGO) , 

3 5 it was determined that the primer sequence is the 
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essential ingredient for efficient amplification. 
The reason for this is that the templates engaged are 
so short that they can be amplified at low 
temperature where misprising .occurs readily. With 
template A (1st arid 2nd generation), templates B-E, 
and template F (third generation), the sensitivity 
could not be improved below 10" 14 moles of template 
under PCR conditions previously reported without 
increasing non-specific amplification products. 



In a preferred mode, Hot Start PCR techniques 
for the third generation cassettes, such as TaqStart 
PCR (Clontech) , AmpliWax (Perkin-Elmer) , and HotWax 
(Invitrogen) . (Mullis, K. PCR Methods and App. 1991, 

15 1, 1-4; and D'Aquilla, R. ; Bechtel, L. J.; Videler, 

J. A.; Eron, J. J.; Gorczyca, P.; Kaplan, J. C. Wucl . 
Acids. Res. 1991, 13, 3749.) The latter technique is 
preferred and yield good best results. The technique 
is based on a wax containing Mg" which is added to 

20 the reaction mixture before cycling. Heating at 94 6 C 

for 30 seconds before the first cycle liberates Mg" 
in solution and allows Taq polymerase to proceed. In 
this manner mispriming that can occur at low 
temperature cannot be amplified since Mg** is 

25 liberated only at high temperature. Because of its 

low density the melted wax remains on the surface of 
the aqueous layer which also serves to prevent 
evaporation of the solution. With Template E the 
sensitivity could be improved to 10" 17 moles but the 

30 yield and specificity are poor (weak bands and non- 

specific products co-migrating with the desired 
product) . Template F gave the best results under Hot 
Start PCR conditions using HotWax and modified PCR 
protocol (see experimental section) . Now we were 

35 able to detect as little as 60-600 molecules of 
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template on agarose gel with ethydium bromide 
staining. Finally it should be mentioned that it is 
preferred to work under strictly sterile conditions 
to avoid contamination arid carry-over when such a 
5 small amount of molecules is to be detected. (Rolfs, 

A.; Schuller, i.; Finckh, U. ; Weber-Rolfs, I. PCR: 
Clinical Diagnostics and Research; Springer-Verlag: 
Berlin Heidelberg, 1992, chapter 5; and Innis, M. A.; 
Gelfand, D. H.; Sninsky, J. J.; White, T. J.; (Eds) 
10 PCX Protocols. A Guide to Methods and Applications; 

Academic Press: San Diego, 1990.) 

Bond cleavage: 

First Generation Cassette. This generation was 
15 assembled on CPG with (D) and (L) amino acids, and 

with different combinations of spacers using standard 
solid phase peptide and DNA synthesis. The ASWY for 
peptide synthesis was 95-98* , and "99% for DNA 
synthesis (see Table 2 and Figure 10). Template A 
20 (Figure 9) and a combination of spacers I and II were 

used in this first generation. 

Conventional ammonia deprotection of the 
polynucleotide leads to quantitative cleavage of the 

25 peptide-polynucleotide conjugate from the resin. 

This result was the same when Expedite 
phosphoramidites (Millipore) is employed. The 
method requires only a short exposure to concentrated 
ammonia, e.g., 1 hour instead of 16 hours. The 

30 cassette is not cleaved by the enzyme when the DNA is 

not deprotected. This result was attributed to its 
hydrophobic character when protected. Mild 
deprotection of the polynucleotide on the phosphate 
backbone (overnight incubation with DBU 0.5 M in 

35 anhydrous DMF) does not affect the reaction cassette. 
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This partial deprotection makes the cassette more 
hydrophilic and allows a-chymotrypsin to reach and 
cleave the peptide substrate of the cassette leading 
to the liberation of the polynucleotide in solution. 
5 Unfortunately the background reaction (spontaneous 

cleavage) accounts for 25% of the reaction, and makes 
this first generation cassette impractical* 

Second Generation Cassette. 

10 This generation is based on TentaGel resin. 

Several cassettes are disclosed herein, with both (L) 
and (D) amino acids, with different combinations of 
spacers and different loadings (Figure 11 and Table 
3) . As opposed to CPG and NovaSyn KD based 

15 cassettes, TentaGel is already endowed with a long 

polyethyleneglycol side arm. It is not necessary to 
introduce a spacer between the resin and the 
substrate. All TentaGel based resins with a loading 
* 80 pmoles/g were eliminated from our studies 

20 because the cassette could not be assembled in good 

yields (Table 3) . 

i 

a-Chymotrypsin cleavage of the substrate: a) in 
the presence and, Jb) in the absence of spermine (1 

25 mM) , c) with or, d) without DMSO (20%), e) with or, 

f) without BSA (1%), gr) with or, h) without single 
stranded DNA binding protein (3.2 MM final 
concentration, Promega), J) with different 
concentrations of reaction cassette, j) with 

30 different enzyme concentrations and, k) over various 

incubation periods, are disclosed herein. Spermine 
and single stranded DNA binding protein were expected 
to minimize the interaction of a-chymotrypsin 
(overall charge +5 to +6) with the polynucleotide 

35 (overall charge -45). (Blow, D. W. in The Enzymes; 
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Boyer P. D. (Ed.), Academic Press: New York, 1971, 
pp. 185-212.) Spermine did not affect the 
sensitivity of the reaction, while single stranded 
DNA binding protein inhibited the reaction. This 
5 latter finding may be due to hindered access to the 

substrate site upon binding to the polynucleotide. 
The concentration of reaction cassette as well as the 
incubation time did not improve the sensitivity, it 
only increased the background reaction. DMSO and BSA 

10 which were expected to minimize non-specific 

interactions with the matrix, also did not affect the 
sensitivity. As expected, (D) amino acid based 
cassettes were not cleaved. The length of the second 
spacer did not affect the sensitivity of the reaction 

15 cassette which in all cases studied did not exceed 

the previously reported, 0.1-1 pinoles (5-50 nM) a- 
chymo trypsin. If charge interactions are important, 
then the positively charged a -chymo trypsin may be 
sequestered in close proximity to the negatively 

20 charged substrate-polynucleotide conjugate, thus 

increasing its local concentration. This then could 
compensate for hindrance effects. With this 
generation cassette, CIO is cleaved under conditions 
were C. is not (Figure, lanes 1-4). Apparently, the 

25 reaction cassette discriminates between two very 

similar substrates differing with only one amino acid 
((L)Ala 2 -(L)Tyr-(L)Ala 2 versus (L) Ala 2 - (L)Hfi- (L) Ala 2 ) . 
The difference in reactivity of a-chymotrypsin 
towards these two substrates in solution is estimated 

30 to be *10 (see above). This result is relevant to 

the construction of encoded reaction cassettes 
libraries. 



35 



The fact that the sensitivity of the reaction 
cassette system stops at 0.1-1 pmole of a- 
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chymotrypsin' means that the uncatalyzed reaction is 
almost as fast as the reaction with 0.1-1 pinoles ot- 
chymotrypsin. At first approximation, there is no 
apparent reason for the sensitivity to reach a' 
5 minimum at this level. There' are several possible 

answers for this question: a) K n can be affected by 
the microenvironment created by the resin or the 
conformation of the substrate which may decrease the 
affinity of the enzyme for the substrate, b) the 
10 uncatalyzed reaction is abnormally fast within this 

cassette system, c) the substrate is not fully 
accessible to the enzyme (i.e. matrix is too 
hydrophobic or substrate hindered by DNA) , d) PCR 
conditions are not optimal. 

15 

For the reaction to occur, the substrate has to 
be recognized by the catalyst; K„ will thus directly 
affect the sensitivity of the reaction cassette. 
Assuming that the K» (10" s M at the best)" is 100 

20 times higher for the substrate on the solid support 

(K..pp - 10" J M) , the fraction of complex formed under 
the most favorable concentration conditions used 
(12510" 3 M of reaction cassette, 5510" 9 M a- 
chymotrypsin) would be 0.92 (4.62510" 9 M) . Assuming 

25 that k e . t (100 s" 1 at the best) is not affected by the 

microenvironment and that all complexes formed are 
productive, the velocity of the enzyme would be 
4.62510" 7 Ms" 1 (4.62510" 9 M 5 100 s" 1 ) . Under the same 
concentration conditions, the velocity of the 

30 background reaction (peptide solvolysis) may be 

approximated to 3.6510" 11 Ms" 1 (3510' 9 s" 1 5 12510" 3 M) . 
(Kahn, D.; Still, W. C. J. An. Chem. Soc. 1988, 110, 
7529.) The ratio of the velocities with and without 
enzyme is > 10 4 . According to this rough estimate, 

3 5 these encoded reaction cassettes should be able to 
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detect at least 10" 17 moles of a-chymotrypsin (0.l5lb" w 
moles 5 10" 4 ) . Therefore, the sensitivity should not 
be limited by K.. The second possibility may be ruled 
out as it would assume that the rate constant for an 
5 inactivated peptide bond sblvolysis would be 3.8510" 5 

s rl (4.62510" 7 Ms"Vl2510" 3 M) which cannot be accounted 
for by a mere microenvironment effect. The third 
possibility concerning the accessibility of the 
substrate to the enzyme was examined using C25 (Table 

10 3) and C33 (Table 4). These cassettes are lacking the 

polynucleotide portion and were used to probe the 
hindrance that may result from this unit. These 
cassettes were also chosen in order to investigate 
the enzyme's accessibility to the substrate in the 

15 second generation reaction cassettes since C33 is 

based on NovaSyn KD resin which has been shown to be 
compatible with biocatalysts. When these cassettes 
were incubated with a-chymotrypsin, C33 is 
quantitatively cleaved at the substrate portion 

20 leading to the liberation of H0-(L)Tyr-(L) Ala 4 -I-OH in 

solution (X^ - 276 nm, e - 1450 M -1 cm" 1 in 0.1 M HC1) , 
whereas C25 did not give rise to any UV detectable 
material in solution, indicating that, indeed, the 
enzyme does not have full access to the substrate on 

25 TentaGel resin, probably because of its hydrophobic 

character. The fact that we do see cleavage on 
agarose gel with TentaGel based cassettes is simply 
due to the powerful amplifying effect of the PCR. The 
fourth possibility was examined using decreasing 

30 concentrations of template A (Figure 9) used in this 

study. The sensitivity leveled out at 10" H moles of 
template. 
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Third Generation Cassette. 

The third generation cassette incorporates 




several improvements, viz.: 

a) The PGR conditions and primer sequences are 
improved in order to increase sensitivity, 
by the design and use of template F, 
5 b) The matrix material" is selected to be 

compatible with the catalyst employed so as 
to further improve the sensitivity. A 
hydrophilic matrix such as NovaSyn KD is a 
preferred material. 

10 

The synthetic procedure using standard SPPS and 
phosphoramidite chemistries for the assembly of the 
third generation cassette is shown in Figure 5. 
Template F containing 2 primers and an encoding 
15 sequence for Ala 2 -Tyr-Ala 2 served as the tag for this 

generation. 

Incubation of C2 6-C32 in the presence of d- 
chymotrypsin leads to the cleavage of the substrate 

20 portion and liberation of the polynucleotide which 

can be amplified and visualized on agarose Gel 
(Figure, for C31 for instance see lanes 5 and 6). C31 
and C32 are cleaved with the same efficiency, 
indicating that our cassette system does not 

25 discriminate between substrate differing at the P3, 

P4, P3», and P4 1 positions. This then sets a limit to 
the specificity of this generation cassette (Figure, 
lanes 7 and 8). This information is crucial for the 
design of combinatorial encoded reaction cassettes. 

30 Unexpectedly, again the lower limit of sensitivity 

did not exceed 0.1-1 pmoles of a-chymotrypsin 
obtained with TentaGel although all parameters 
(incubation time, PCR sensitivity, choice of the 
resin, length of the spacers, DNA synthesis, etc.) 

3 5 were optimized for this study. Even more 
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unanticipated were our findings with C30 that: a) the 
detection limit is the sane whether we used "0.01 mg 
(1 bead) or "10 mg of reaction cassette. Although 
difficult to explain at this stage, the fact that 1 
5 bead, leads to the sane sensitivity as 10000 beads 

will tie extremely helpful and economical in the 
screening of combinatorial libraries of substrates 
and catalysts; b) after the completion of the 
reaction, 1 /xl of the reaction media with the 
10 cassette and a -chymo trypsin can be diluted up to 10 9 

fold and still give rise after the' PGR to a band on 
agarose gel corresponding to the polynucleotide tag. 
In contrast 1 jjI from the uncatalyzed reaction media 
no longer give rise to any detectable material after 
15 a 10 3 fold dilution. This experiment clearly 

delineates that 0.1-1 pmoles of the enzyme produces 
~10 £ times more polynucleotide in solution than the 
background reaction and yet, it is the lower 
concentration of a-chymotrypsin that can be detected 
20 over the background. Although mathematically it 

should be possible to detect enzyme amounts 2 4 510" 21 
moles (0.1510" 12 moles 5 1/10 6 5 1/25 « 2400 
molecules) , experimentally, this amount cannot be 
detected unless a-chymotrypsin remains at a 
25 concentration * 5 nM. (0.1510" 12 moles corresponds to 

the lowest detectable amount of a-chymotrypsin; 1/10 6 
corresponds to the dilution factor of the reaction 
media that still can give rise to a signal over 
background on agarose gel; 1/25 comes from the fact 
30 that only 1 /xl of the reaction media (25 Ml) was used 

for amplification by the PCR.) Possibly the enzyme 
interacts strongly with the polynucleotide and cannot 
reach the substrate until all sites are saturated. 
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Bond formation detections • 

The east (89) and west (C34) components of the 
reaction cassette were prepared as shown in Figure 6 
and 7 respectively,, in semi-organic media, a- 
5 ' '* chymbtrypsin catalyzes amide bond formation, between 
' Tyrosine methyl esters and a free amine. The reaction 
proceeds through the formation of an acyl-enzyme 
which in the semi-organic medium is attacked by the 
amine. (Wong, C.-H.; Whitesides, G. M. Enzymes in 

10 Synthetic Organic Chemistry; Baldwin, J. E. , FRS; 

Magnus, P. D., FRS; (Eds.); Tetrahedron Organic 
Chemistry Series; Volume 12; Pergamon: Oxford; 1994.) 
Figure 14 shows an agarose gel of the PCR products of 
1 bead taken from the reaction of C34 (or C35) with 

15 89 in the presence of a-chymotrypsin. When NovaSyn KD 

(C34) is used as the matrix, the background reaction 
overshadows the catalyzed reaction (Figure 14, lanes 
7-9) • A control experiment with the resin alone 
(without the substrate portion) gives the same 

20 results which indicates that the background reaction 

is actually due to the adsorption of the 
polynucleotide onto the matrix, which cannot be 
eliminated completely even after extensive washing, 
but can be released in the PCR reaction media at high 

25 temperatures (lanes 1-3). With TentaGel based resins 

(C35), the background reaction could be eliminated 
after extensive washings (lanes 4-6). The reaction in 
the presence of the catalyst gave rise to a weaker 
band when compared to the background reaction 

30 (uncatalyzed amide bond formation), lanes 10-12. This 

trend is at variance with our expectations but can be 
explained as follows: a) a-chymotrypsin catalyzes the 
hydrolysis of the Tyrosine methyl ester of C34 (or 
C35) , thus lowering the substrate concentration 

35 available to react with the polynucleotide-peptide 




conjugate. The uncatalyzed reaction gave a stronger 
signal (lane 10) because it is not hampered by this 
fact; Jb; ot-chymotrypsin forms a stable aicyl-enzyme 
intermediate with the most accessible sites, blocking 
5 the passage to the uncatalyzed reaction between the 

polynucleotide-peptide conjugate and the resin bound 
substrate. 

Whatever the reason, it is disclosed herein that 
one can detect the slow-uncatalyzed peptide bond 
formation with encoded reaction cassette disclosed 
herein. At the same time, the encoded reaction 
cassette is disclosed to be able to detect the 
formation of chemical bonds. These results also show 
that with the present design, the size of the 
catalyst seems to handicap the methodology. For this 
reason, one can employ, a chemically catalyzed bond 
formation using NaCNBH 3 , as disclosed as follows. 

20 Detection of inline bond formation: 

The synthetic scheme is shown in Figure 8. We 
have used TentaGel for reasons discussed (vide 
supra) . The matrix is firkt activated with 
glutaraldehyde and then coupled with 810 followed by 

25 NaCNBHj reduction. After extensive washing, one bead 

is submitted to the PCR. Figure 14 shows an agarose 
gel that demonstrates the formation of the imine 
bond. The resin that was pre-treated with 
glutaraldehyde give rise to the 4 5-mer band (lane 

30 13), whereas, as expected, the resin that was not 

submitted to this pre-treatment, did not give rise to 
the 4 5-mer (lane 14) . Apart from its utility for the 
demonstration of the second part of the principle of 
the reaction cassette, this synthetic methodology may 

35 be used for the construction of an encoded library of 
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reaction cassettes from a library of peptide- 
polynucleotide conjugates library that was obtained 
beforehand via an orthogonal synthetic scheme. The 
library thus obtained could be used in the bond 
5 , cleavage detection node. ■ - 
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AHhrpviations: 

DMTr (dimethoxytrityl) MMTr (monomethoxytrityl) , 
Fmoc (fiuorenylmethoxycarbonyl) , Boc (tert- 
butyloxycarbonyl) , DCM (dichloromethane), DMF 
( dime thy 1 formami de ) , DMA (dimethylacetamide) , ASWY 
(average stepwise yield), OAY (overall yield)/ DIEA 
(diisopropylethylamine) , TBAF (tetrabutylammonium. 
fluoride), THF (tetrahydrofurane) , MeOH (methanol), 
dH 2 0 (distilled water) , AcONa (sodium acetate) , SPPS 
(solid phase peptide synthesis), t-Bu ( tert-butyl) , 
coned (concentrated), CPG (Controlled Pore Glass). 



funeral 

NMR spectra were recorded on a Bruker 300 MHz 
for 'H NMR and 500 MHz for 13 C NMR, with the solvent 
as internal reference. All l H and I3 C NMR's in D 2 0 were 
performed with (2-methyl) 2-propanol as internal 
reference (CH 3 , 1.36 ppm ; HO£(CHj) 3 , 68.7 and 31.6 
ppm) . Melting points were measured on a Fisher-Johns 
Melting Point Apparatus. Chromatographic support was 
Silica flash Merck 60 (0.040-0.063 mm). The mass 
spectra were performed at the Mass Spectrometry 
Facility of The Scripps Research Institute. 

synthesis of suhstr atP* as illustrated in figure 4: 

Fmoc-Ala 2 and Fmoc-Tyr (OSi t-BuMe 2 ) were 
synthesized according to a procedure described in: 
Atherton, A.; Sheppard, R. C. Soiid Phase Peptide 
Synthesis: A Practical Approach; Oxford University 
Press: Oxford, 1989, pp. 47-53. See Supplementaary 
Material for Fmoc-Ala 2 and Fisher, P. M. Tetrahedron 
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1 Lett. 1992, 33, 7605 for Fmoc-Tyr (0Sit-BuMe 2 ) . The 

other amino-acids, HBTU (O-benzotriazol-lyl- 
N, N, N' , N' -tetramethyl-uronium hexaf luorphosphate) and 
HOBt (l-hydroxybenzotriazoie hydrate) are 
; commercially available from Novabiochem; glutaric 

6 anhydride, DIEA (diisopropylethylamine) , anhydrous 

DMA (dimethylamine) and DMF (dimethyl formamide) from 
Aldrich; solvents (DCM (methylene chloride), MeOH 
(methanol), THF (tetrahydrofuran) ) from Baxter were 
all HPLC (high performance liquid chromatography) 
11 grade with low water contents (< 0.001%). Before use, 

the resins are preconditioned with 3% C1 3 CC0 2 H/DCM 
treatment for 10 min followed by extensive DCM, DMF, 
10% DIEA/ DMF, DMF and DCM washes. 

16 Synthesis of substrates (S1-S7: sequenc e IDNO.'s i- 

5) as illustrated in figure 4: 

The peptide substrates (S1-S7; SEQUENCE ID NO. ' s 
1-5) were assembled manually on either a 500 A CPG 
(controlled pore glass) (-50 ^moles/g, Sigma or CPG 

21 Inc.), TentaGel-S-NH 2 support (-260 /tfnoles/g, 8 

Novabiochem or Rapp Polymere), or NovaSyn KD (-100 
Amioles/g, Novabiochem) according to standard Ftnoc 
and/or Boc methodologies (Stewart et al. Solid Phase 
Peptide Synthesis; Pierce Chemical Company: Rockford, 

26 Illinois, 1984; Atherton et al. Solid Phase Peptide 

Synthesis: A Practical Approach, Oxford University 
Press: Oxford, 1989). To have a final loading of 40- 
60 ^moles/g, the first step of the synthesis is 
performed with an excess of solid support to amino 

31 acid monomer; after all the final washings, the 

unreacted amino groups are capped (0.25 volume of 
acetic anhydride 4.23 M in 2, 6-lutidine; 0.75 volume 
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1 of N,N-dimethylaminopyridine, 0.53 M in THF, 3 min) . 

The same procedure of partial functionalizatipn and 
capping applies for other loadings. The ASWY for 
peptide synthesis was 95-98%. After each coupling 
'. step the resin is washed with DMF, DCM and MeOH. The , 
6 completion of the reaction is controlled with the 

Kaiser test (Kaiser et al. Anal. Biochem. 1910, 34, 
595) . The loading is determined when possible using 
either the DMTr (Dimethoxytrityl) cation assay (Gait 
et al. Oligonucleotide Synthesis : A Practical 
11 Approach; Oxford University Press: Oxford, 1990, p. 

48 or the fulvene-piperidine adduct assay (Ftnoc 
assay; e 302 n « ° 7800 M" 1 cm" 1 in piperidine/DMF 20%; 
Atherton et al. Solid Phase Peptide Synthesis: A 
Practical Approach, Oxford University Press: Oxford/ 
16 1989) . 

The t-BuMe 2 Si protecting group of the hydroxyl 
moiety on Tyrosine, as illustrated in figure 5, was 
removed at the end of the cassette assembly by 
treatment with 1 M TBAF in THF followed by extensive 
21 washing with THF, MeOH, dH 2 0, tris-HCl buffer (20 mM, 

pH 8, NaCl 160 mM) and dH 2 0. After this step the 
reaction cassette is ready for use in the bond 
cleavage detection mode. 

The substrate S8 (SEQUENCE ID No. 6, figure 4) 
26 was prepared in 5 steps using standard solution phase 

peptide chemistry (vida infra) . Since this compound 
tends to form a gel in fairly dilute solutions, its 
coupling either to TentaGel or NovaSyn KD was 
performed as follows: NovaSyn KD (0.5 g, 50 /imoles),) 
31 HBTU (50 mM) (O-benzotriazol-lyl-N, N, N' ,N' - 

tetramethyl-uronium hexaf luorphosphate) and HOBt (50 
mM) (1-hydroxybenzotriazole hydrate), DIEA (200 mM) , 
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88 (SEQUNCE ID No. 6) (50 mM) in DMA (3 ml) . the . 
mixtures are shaken at 20"C for 8-16 h. These 
conditions correspond to standard conditions diluted 
2-4 times. The resin -is then washed with .DMA 
(dimethyl amine) / MeOH (methanol) and DCM (methylene 
chloride). Tyr(O-t-Bu) is deprotected with T FA/ DCM 
(trifluoroacetic acid/methylene chloride) (1/4) for 
10 min; the resin is washed with DCM, DMF, MeOH and 
DCM, and dried under high vacuum. The resin is now 
ready for use in bond formation detection in 
combination with the other part of the substrate as 
illustrated in Figure 7. For TentaGel the coupling 
was performed as follows: TentaGel (0.25 g, 65 
fjmoles) , HBTU (65 mM) , HOBt (65 mM) , 88 (SEQUNCE ID 
No. 6) (65 mM), DIEA (260 mM) in DMA (3 ml). Shake at 
20°C for 16 h. For the preparation of 89 (SEQUNCE ID 
No. 6) and S10 (figure 6) see DNA synthesis vida 
infra. 



IPS 



DNA Synthesis: 

21 All monomers, reagents and solvents for DNA 

synthesis were purchased from Applied Biosystem (ABI) 
or Glen Research unless otherwise indicated. Expedite 
Q-cyanoethyl phosphoramidites (Millipore) are 
protected with t-butylphenoxyacetyl on the amino 

26 groups of Adenine, Guanine, and Cytidine; Thymidine 

is not protected; the phosphate backbone generated 
with these monomers is protected as fi-cyanoethyl 
ester. The phosphoramidite monomers used with a few 
of the second generation cassettes require incubation 

31 in concentrated ammonia in a sealed tube at 55°C for 

16-20 h. In order to minimize this exposure we used 
Expedite phosphoramidites with most of the reaction 
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I .' cassettes which ailowed .for, rapid deprotection (1 

hour at 55°C in coned ammonia) . 

DNA synthesis was carried out on a 394 Applied 
Biosysteni DNA Synthesizer using standard 
• • phosphoramidite chemistry (Gait et,al. 

6 Oligonucleotide Synthesis, ,: A practical Approach: 

Oxford University Press: New York, 1990). The 
standard 1 nmole cycle (termed Procedure A, 97 steps) 
was modified as described earlier for the synthesis 
. on TentaGel (termed Procedure B, 97 steps as 

.1 described in (Fenniri et al. Proc. Natl. Acad. Sci. 

USA 1995, 92, 2278) . It was further modified in 
order to adapt it to both TentaGel and NovaSyn KD 
resins (termed Procedure C, 99 steps) as follows, in 
this order: a) DCM (methylene chloride) bottle 

L6 (bottle 19) was replaced with anhydrous DMF; b) Cap A 

(bottle 11) was replaced by Expedite Cap A 
(Millipore) ; c) bottle 15 (iodine 0.1 M) was replaced 
by a less concentrated one (0.02 M) ; d) in the 
monitoring mode, step 77, 18 to column for 10 s, was 

II replaced by 19 to column for 35 s; e) in the non- 
monitoring mode, one washing step was added, 19 to 
column for 35 s (step 80); f) in the non-monitoring 
mode, another washing step was added, 19 to column 
for 35 s (step 94); g) In the non-monitoring mode, 

26 steps 84, 87, and 90 were prolonged from 5 s to 10 s; 

h) all washing steps 3, 59, 61, 66, and 96 were 

prolonged from 10 s to 30 s; i) the incubation time 
~ ""with phosphoramidite and" tetrazole (step 45) was 

prolonged from 25 s to 120 s; for the introduction of 
31 spacer VIII this step was extended to 900 s; j) the 

concentration of all monomers used on the DNA 
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synthesiser was 0.1 M in anhydrous acetonitrile; k) 
synthesis was monitored every 3 to 5 couplings. 

The polynucleotide is deprotected upon treatment 
with coned NH 3 f or , 1 hour at 55°C. The EMTr group is 
removed upon treatment with 3% C1 3 CC0 2 H in DCM (5 
min) , followed by extensive washing with DCM, THF, 
Me OH, tris-HCl buffer (20 mM, pH 8, NaCl 160 mM) , and 
dH 2 0. Tyrosine containing cassettes are treated for 15 
min with 1 M TBAF in THF to remove the phenolic 
hydroxyl protecting group SiMe 2 t-Bu and washed as 
above. After this step, the cassette is ready to use 
in the bond cleavage detection mode. 



16 



Substrate S9 used for the detection of enzymatic 
catalyzed bond formation (figure 6): 



45N« 0 0 
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For enzyme catalyzed bond formation detection, 
S9 was built on CPG 1000 A (1 pinole scale) provided 
with a long chain alkylamino group bearing the first 
21 methyl phosphoramidite as follows: Standard 1 //mole 

cycle (procedure A) of the 394 Applied Biosystem DNA 
synthesizer was used with methyl phosphoramidites and 
"V standard ABI reagents. After DNA synthesis, spacer 
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VIII was introduced followed by spacer IX or, X 
(Millipore) . The MMTr amino-protecting group was 
removed upon treatment with 80% AcOH for 1 hour at 
20°C followed by washing with dH 2 0, tris-HCl buffer 
(20 mM, pH 8, NaCl 160 mM) , dH 2 0, MeOH, DCM, and dried 
under high vacuum. The functionalized polynucleotide 



thus obtained is further derivatised with Fmoc-Ala 2 
while still on the solid support following standard 
SPPS under the following concentration conditions: 
for 4 Minoles of resin bound polynucleotide, HBTU (25 
L mM) , DIEA (100 mM) , Fmoc-Ala 2 (25 mM) , DMA (0.5 ml), 

shake at 20°C for 2 h. The resin is washed with DMF, 
MeOH, DCM and dried under high vacuum. After this 
step the phosphate backbone is deprotected using a 
mixture of thiophenol/Et 3 N/dioxane (1/1/2) for 45 min 
6 and washed with dioxane and MeOH. The peptide- 

. polynucleotide conjugate. is detached from the resin 
and deprotected on the terminal Fmoc-amino acid group 
(1 hour in concentrated ammonia at 55°C) and filtered 
through 0.8 /am disposable syringe filter (Corning). 
1 After this step, the polynucleotide-peptide conjugate 

is deprotected on the nucleobases upon treatment for 
18 hours at 45-50°C. The solution is lyophilized and 
ethanol precipitated 3 times from 3 M AcONa pH 5.2 
and used without further purification for enzyme 
6 catalyzed peptide coupling. 
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For detection of chemically catalyze k „ 
formation, sio (the ™i , catalyzed bond 
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<™ u Pon treat , ent ; r h ™ gr0UP is 

followed by extensive u h ° Ur at 20 °C 

supernatant is recover^ * , The 
coupling reaction. the 




The cassette (O.oi-io ma s o in ' iL: 
suspended in 25 „1 tris-HdT,, " 1 " 0l " /9) 
» *) contalnin 1 ., "^T " « «*' «* Nacl 
0 units of ^t^l: an 't„ 10 "'' 10 ' ! ' 10 " " 

^ 20-c. supernatant nuL a 2 r tU " "** 

3 ° "in. 1, 2 , 3 , 4 " ' 2 " 1J W " e take n after 

submitted to the PCR ' ^ Snd " h ' - -« 




A few hundred beads (~5 mg) of C34 or C35 were 
placed in several wells of a 96-well filtration plate 
assembly (0.65 mm, Hydrophilic Durapore, Low Protein 
Affinity, Millipore) and were incubated with S9 (0.32 
mM) in tris-HCl 100 mM pH-8/DMSO (40/60, final pH 
8.75) in the presence of 1, 0.1, or 0 units of a- 
chymotrypsin (final volume 200 nl) ; the plate is 
shaken for 1 hour and is adapted to a Multiscreen 
Filtration System (Millipore) which allows the 
reaction to be stopped by applying vacuum to the 
device followed by extensive washings (DMS0/dH 2 0 60%, 
dH 2 0, AcOH 80% (10 min), dH 2 0, NH 3 coned (10 min), 
dH 2 0, TFA 2% (10 min), NH 3 coned, dH ? 0 80°C, MeOH) . One 
bead is taken from each well for PCR amplification. 
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A few hundred beads of TentaGel (-5 mg) were 
placed in several wells of a 96-well filtration plate 
assembly (0.65 mm, Hydrophilic Durapore, Low Protein 
Affinity, Millipore) and were incubated with 20% v/v 
glutaraldehyde/0.1 M phosphate buffer, pH 8.5. The 
plate was shaken at room temperature for 3 hours and 
then the beads extensively washed with dH 2 0 and 0.1 M 
phosphate buffer, pH 8.5. The amino- functionalized 
polynucleotide S10 (0.15 mM) was added in 0.05 M 
borate buffer pH 10 containing 60% DMSO. The plate is 
shaken for 20 hours at 37°C, it was then adapted to 
the Multiscreen Filtration System which allows the 
reaction to be stopped by applying vacuum to the 
device, followed by extensive washing with 0.1 M 
borate buffer pH 10, then 0.1 M phosphate buffer pH 
8.5. 200 nl 0.2 M NaCNBHj in 0.1 M phosphate buffer, 
pH 8.5 was added to each well, and the plate was 
shaken for 3 hours at 20*C and stored at 4°C overnight 
without shaking. The liquid phase was filtered off 
the wells using the Multiscreen Filtration System, 
and the resin was washed extensively with dH 2 0, 80% 
AcOH (2 5 10 min), NH 3 coned (2 5 10 min) , TFA 2% (2 5 
10 min), NHj coned (2 5 10 min), dH 2 0 80 o C, and MeOH. 
One bead from each well was taken for PCR 
amplification . 
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PCn Experiments: 

All the templates; SEQUENCE ID No.'s 8-14 (A-G, 
Figure 9) and corresponding primers were custom made 
via Operon Corporation. For the reaction cassettes 
with template A (SEQUENCE ID No.l), aliquots (1 nl) 
from the reaction mixture were mixed with the PCR 
components: MgCl 2 " 2.5 mM ( Pr omega ) , 1.2 ills Taq buffer 
(Promega)/ 2 fil; deoxynucleotide triphosphates 2.5 mM 
(Pharmacia), 1.6 fil; primer I 100 pmoles//il, 1 ill; 
Primer II 100 pmoles/^l/ 1 1*1; cffl 2 0, 17.7 nit Taq 
polymerase (Promega), 0.5 ^1 (2.5 U), was added just 
before starting the first PCR cycle. A positive 
control (PCR components only) was run with dH 2 0 
containing 1 pmole of the polynucleotide sequence 
used in this study. A negative control was run under 
the same conditions without the polynucleotide 
sequence. The PCR was run. on a Perkin-Elmer-Cetus 
9600 instrument with the following cycle program: 
denaturation 94°C, 30 s; annealing 55°C, 30 s; 
extension 72°C, 30 s. After 35 cycles the results 
were analysed on agarose gels (1% Gibco-BRL agarose, 
2% FMC NuSieve GTG igarose with 90 mM Tris/64.6 mM 
borate/2.5 mM EDTA, pH 8.3 (15 TBE) at 103 mV] . For 
the reaction cassettes with template E (SEQUENCE ID 
No. 13) (Table 4) we used HotWax low concentration Mg 44 
beads (1.5 mM final concentration in a 50 fil final 
volume, Invitrogen); Taq buffer (Promega)/ 5 til; dNTP 
2.5 mM (Pharmacia), 4 til; Primer I 100 pmoles//U, 2.5 
111; Primer II 100 pmoles/Ml, 2.5 ill; Taq polymerase 
(Promega) 1 ill (5 U); reaction supernatant fluid, 1 
111; dH 2 0, 34 ill. The cycle program was preceded by a 
pre-heating step at 94°C for 30 s and ended by an 
extension at 72°C for 10 min. 35 Cycles were 
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pirogrammed as follows: denaturation 94°C, 30 s; 
annealing 49°C, 1 min; extension 72°C, 30 s. Negative 
and positive controls as well as analysis on agarose 
gel were performed, as above. For bond formation 
detection,, template F ( (SEQUENCE ID No. 14) was used 
as a Tag, and PCR conditions were identical to those 
described above with this template but with 25 cycles 
only. 
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Preparation of spacer molecules (represented in prior 
art and/or commercial! v available compounds); 

The compounds I (Shaller et al. J. Am, Chem. 
Soc. 1963, 85, 3821), II (Nielsen et al. Methods 
1994, 6, 361), Fmoc-Tyr (OSit-BuMe 2 ) (Fisher, P. M. 
Tetrahedron Lett. 1992, 33, 7605), 2; 21 (Morin.et 
al. Tetrahedron 1992, 48, 9277), and 3; 22 (Prakash 
et al. J. Chem. Soc. Perkin Trans. 1 1991, 1273) are 
prepared according to previousely reported 
procedures. The compounds 19, 20, 23, 24, 25 and all 
the reagents and solvents are commercially available 
from Aldrich. Several attempts to derivatize 1, 5 and 
9 with 19, 20, 21 or 22 under various conditions 
failed completely or gave only a poor yield of the 
desired product. In this section we describe the 
reaction that led to spacers III-VII in reasonably 
high yield (figure 3) . The spacers IX (Glen 
Research) and X (Millipore Research) were 
commercially available. 
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Confound 5: Compound 1 (25.0 g, 89 mMol; Sigma 
Chemical company), 3, 4-dihydro-2ff-pyran (7.5 g, 89 
mMol; Aldrich), in methylene chloride (500 ml) were 
cooled to 0°C and stirred with few drops of 
concentrated HC1 for 1 h. The temperature was raised 
to 20°C and the stirring was maintained overnight. 
NajCOj (3 g) was added and the suspension was stirred 
for 1 hour then filtered, followed by evaporation of 
the solvent to dryness and flash chromatography (Si0 2 , 
ethylacetate/methylene chloride 0-100%) to afford 
compound 5 (C 17 H 34 0 9 , 9. 8 g, 30%) which was obtained as 
a viscous colorless oil. Rf = 0.1 (Si0 2 , ethyl 
acetate). 'H NMR (CDC1 3 ) 6: 4.56 (t, 3 J(H,H) = 2.9 Hz, 
1H, OCHO) ; 3.9-3.3 (m, 28H, CH 2 OCH 2 and CifcOH) ; 1.65- 
1.42 (m, 6H, CH 2 CH 2 CH 2 of THP) . n C NMR (CDCl,) 6: 98.1 
(OCHO); 72.1, 70.1, 70.0, 69.9, 69.8, 66.1, 61.5, 
61.0, (CH 2 OCH 2 and CH 2 0H) ; 29.7, 24.5, 18.4 
(CH 2 CH 2 CH2CH 2 0) . FAB+MS (NBA/Nal): 367 (M+H')/z; 389 
(M+Na*)/Z. 




BNSOOCID: <WO 9622391 A t_l_> 




-53- 



Svnthesis of l-hvdroxv-17-phfcaliffiido-3. 6. 9. 1 7 . 1 V 
pentaoxaheptadeeane <9) figure 3: 
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Compound 9: The compound 1 (25.0 g, 89 mMol; Sigma 
chemical company)/ phtalamide (12.9 g, 89 mMol)/ PPh, 
(23.0 g, 89 mMol) and anhydrous THF (450 ml, 
distilled over Na) were stirred at 20°C under inert 
atmosphere. DEAD (15.3 g, 89 mMol; 
diethylazodicarboxylate is commercially available 
from Aldrich) in anhydrous THF (100 ml) was added 
dropwise over 2 hours at 20°C and the reaction 
mixture was stirred overnight. The solvent was 
evaporated to dryness, the residue was taken in 
boiling dH 2 0. After cooling down, the precipitate was 
filtered and the aqueous solution was evaporated to 
dryness under high vacuum. The viscous oil thus 
obtained was chromatographed (Si0 2 flash, EA/hexane 
50-100%) yielding 9 as a colorless viscous oil 
(C 20 H 29 NO 8 , 22.0 g, 60%). Rf = 0.13 (Si0 2 , EA) . *H NMR 
(CDClj) 5: 7.82 (m, 2H, Ar) ; 7.69 (m, 2H, Ar) ; 3.88 
(t, 3 J(H,H) = 6.1 Hz, 2H, NCH 2 ); 3.72 (t, 3 J(H,H) - 
5.4 Hz, 2H, NCH 2 CH 2 0) ; 3.69-3.54 (m, 22H, CH 2 OCH 2 and 
CH 2 OH) . "C NMR (CDC1 3 ) 6: 167.9 (CO); 133.7, 131.7, 
122.9 (Ar); 72.2, 70.3, 70.2, 70.1, 70.0, 69.95, 69.7 
(OCH 2 CH 2 0) ; 67.5 (0£H 2 CH 2 N) ; 61.2 (OCH 2 £H 2 OH) ; 36.9 
(£H 2 N) . 

FAB+MS (NBA/Nal ) : Expected exact mass for (M+H*)/z 
412.1971, observed 412.1983; 434 (M+Na 4 )/z. 
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gymi-hasls o f 1 -nht-«1 Imi do- 17- f (4 . 4 1 -hi SfflethQXV" 
t-ri t v n -oxyl -3.6.9 - 17.1 S-pRnfanxaheptarienane (12) 
f inure 3: 



Confound 12: Compound 9 (1.8 g, 4.4 mMol), DMTrCl 
(4,4 dimethoxytrityl chloride is commercially 
available from Aldrich) (2.2 g, 6.6 mMol) were 
stirred in pyridine (20 ml) for 40 hours at 20 °C. 
Et 3 N (12 mMol, triethyl amine) was added and the 
solvent was evaporated to dryness. 12 (C 4 ,H« 7 NO 10 , 2.8 
g, 88%) was obtained after- flash chromatography (Si0 2 , 
EA/DCM 0-50%) as a yellow viscous oil. Rf = 0.15 
(SiOj, EA/hexane 50%). *H NMR (CDC1 3 ) 6: 7.83 (m, 2H, 
phtalimide) ; 7.68 (m, 2H, phtalimide) ; 7.47 (d, 
3 J(H,H) = 7.1 Hz, 2H, Ph DMTr) ; 7.34 (d, 3 J(H,H) = 8.9 
Hz, 4H, p-MeOPh DMTr); 7.26 (t, 3 J(H,H) - 7.6 Hz, 2H, 
Ph DMTr); 7.18 (t, 3 J(H,H) = 7.3 Hz, 1H, Ph DMTr); 
6.81 (d, 3 J(H,H) = 8.9 Hz, 4H, p-MeOPh DMTr); 3.88 
(t, 3 J(H,H) =5.9 Hz, 2H, CH 2 N) ; 3.77 (s, 6H, CH 3 0) ; 
3.75-3.5 (m, 20H, CH 2 OCH 2 ) ; 3.21 (t, 3 J(H,H) = 5.4 Hz, 
2H, CH 2 ODMTr) . "C NMR (CDC1 3 ) 5: 168.2 (CO); 158.3, 
145.0, 136.3, 130.0, 128.1, 127.7, 126.6, 123.2, 
113.0 (DMTr Ar); 133.9, 132.2, 123.2 (phtalimide Ar) ; 
85.8 ( (p-MeOPh) 2 Ph£0) ; 70.7, 70.6, 70.55, 70.5, 70.45, 
70.0, 67.8 (CH 2 OCH 2 ); 63.1 (CH 2 ODMTr) ; 55.1 (CH 3 0) ; 
37.2 (CH 2 N) . FAB+MS (NBA/Nal): Expected exact mass 
for (M+Na*)/z 736.3098, observed 736.3123. 
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Compound III: Compound 12 (1.7 g, 2.3 mMol), (NH 2 ) 2 
6 (hydrazine) (23.4 mMol, 0.73 ml) and EtOH (100 ml) 

were refluxed for 3 h. After cooling down, the 
precipitate was filtered and the solvent evaporated 
to dryness. The residual oil was taken in 
diethylether and the precipitate was filtered off. 

11 The organic phase was evaporated to dryness yielding 

III (C3jH 45 N0 8/ 1.2 g, 89%) as a viscous pale yellow 
oil which, was used without further purification. >H 
NMR (CDC1 3 ) 6: 7.45 (d, 3 J(H,H) - 7.3 Hz, 2H, Ph 
DMTr) ; 7.33 (d, 3 J(H,H) = 8.8 Hz, 4H, p-MeOPh DMTr); 

16 7.27 (t, 3 J(H,H) - 7.0 Hz, 2H, Ph DMTr); 7.18 (t, 

3 J(H,H) = 7.0 Hz, 1H, Ph DMTr); 6.81 (d, 3 J(H,H) =8.8 
Hz, 4H, p-MeOPh DMTr); 3.77 (s, 6H, CH 3 0) ; 3.73-3.59 
(m, 18H, OCH 2 CH 2 d) ; 3.48 (t, 3 J(H,H) = 5.1 Hz, 2H, 
CHjCHjNHj); 3.21 (t, 3 J(H,H) = 5.2 Hz, 2H, CH 2 ODMTr) ; 

21 2.84 (t, 3 J(H,H) = 5.1 Hz, 2H, CHjC&NHj) . 13 C NMR 

(CDClj) 6: 158.3, 145.0, 136.3, 130.0, 128.1, 127.7, 
126.6, 113.0 (Ar); 85.8 ( (p-MeOPh) 2 Ph£0) ; 73.4, 70.7, 
70.6, 70.5, 70.2 (CH 2 OCH 2 ) ; 63.1 (CH 2 0DMTr) ; 55.2 
(CH 3 0) ; 41.7 (CH 2 NH 2 ) . 

26 FAB+MS (NBA/Nal): 584 (M+H*)/z; expected exact mass 

for (M+Na*)/z 606.3043, observed 606.3072. 
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1 synthesis of 1 -hydroxy- 17- f (4 . 4 ' - 

hi smPthoxvt-rl tvl 1 oxvl -3.6.9.12.15- 

ppni-aoxaheptadecam>H31 (formed in step i. figure 3: 
compound is an inrormeriiate and not shown > i 
Compound 13: Compound 1 (5 g, 17.7 mMol) and DMTrCl 
6 dimethoxytrityl chloride (6.3 g, 17.7 mMol) were 

stirred in pyridine (17 ml) under inert atmosphere at 
20°C for 48 h. Et,N (4 ml) was added and the solvent 
was evaporated to dryness. The oily residue thus 
obtained was taken in ether (100 ml) and extracted 

11 with dH 2 0 (100 ml) . The organic layer was dried over 

Na 2 S0«, filtered, evaporated to dryness, and dried 
under high vacuum. 13 (C 33 H u O„ 3.3 g, 32%) was 
obtained as a yellow oil after flash chromatography 
(Si0 2 , EA/hexane 50-100%). Rf = 0.12 (Si0 2 , EA) . *H 

16 NMR (CDClj filtered on basic A1 2 0 3 ) 5: 7.46 (d, } J(H,H) 

= 7.2 Hz, 2H, Ph DMTr); 7.34 (d, 3 J(H,H) - 8.8 Hz, 4H, 
p-MeOPh DMTr); 7.27 (t, 3 J(H,H) = 7.8 Hz, 2H, Ph 
DMTr); 7.19 (t, 3 J(H,H) =7.2 Hz, 1H, Ph DMTr); 6.82 
(d, 3 J(H,H) =8.9 Hz, 4H, p-MeOPh DMTr); 3.78 (s, 6H, 

21 CH 3 0) ; 3.7-3.57 (m, 22H, CH 2 OCH 2 and CH 2 OH) ; 3.22 (t, 

3 J(H,H) = 5.3 Hz, 2H, CH 2 ODMTr) . n C NMR (CDC1 3 
filtered on basic A1 2 0 3 ) 6: 158.3, 145.1, 136.3, 
130.1, 128.2, 127.7, 126.6, 113.0 (Ar) ; 85.9 ( (p- 
MeOPh) 2 Ph£0) ; 72.5, 70.7, 70.68, 70.65, 70.62, 70.58, 

26 70.52, 70.48, 70.45, 70.42, 70.3 (CH 2 OCH 2 ) ; 63.1 

(CH 2 0DMTr) ; 61.7 (CH,OH) ; 55.2 (CH,0) . 
FAB+MS (NBA/Csl): expected exact mass for (M+Cs")/z 
717.2040, observed 717.2022. 



GNSOOCia <wro_M223»iAi.i.> 



-57- 

i i , 

1 synfh»sis of 1.17-bisf (n-tnlueriftsiil f onvl ) oxvl - 

vfi.q.i?.i5-ppntaoxahPnt«decane (14) (step *K' figure 

rnmnoiind is an intermf>d1 ate and not Shown) ; 
Compound 14: Compound 1 (25.0 g, 89 mMol), p- 
toiuenesulfohyl chloride (50.9 g, 267 mMol) were 
6 stirred in dry pyridine (75 ml) under inert 

atmosphere at 0°C for 4 h. The reaction media was 
then poured on ice (800 ml) and stirred vigorously. 
DCM (300 ml) was added and the aqueous layer was 
carefully acidified to pH ~1 with 3 MHC1. The 

11 aqueous layer was further extracted with DCM (2 5 300 

ml) and the organic layers were combined and 
extracted with saturated NH«C1 (2 5 300 ml), dH 2 0 (2 5 
300 ml), dried over MgS0 4 , filtered, evaporated to 
dryness, and the residual oil was dried under high 

16 vacuum. After flash chromatography (Si0 2 , MeOH/DCM 0- 

1%) 14 was obtained as a viscous colorless oil 
(C 2< H 38 SjO„ 40 g, 76%). Rf ■ 0^49 (Si0 2 , EA) . 
»H NMR (CDClj) 5: 7.75 (d, 3 J(H,H) - 8.3 Hz, 4H, Ar) ; 
7.31 (d, 3 J(H,H) - 8.1 Hz, 4H, Ar) ; 4.10 (t, 3 J(H,H) = 

21 4.7 Hz, 4H, CH 2 OTs); 3.65-3.53 (m, 20H, CH 2 OCH 2 ) ; 2.40 

(s, 6H, CH 3 Ts) . 1J C NMR (CDCl,) 6: 144.7, 132.7, 
129.7, 127.8 (Ar); 77.0, 76.8, 71.2, 70.5, 70.4, 
70.3, 69.2, 68.5 (CH 2 0) ; 21.5 (CH 3 Ts) . FAB+MS 
(NBA/Nal) : expected exact mass for (M+Na')/z 613.1753, 

26 observed 613.1740. 



s V ni-hP*-is of 1 .77-his- (hydroxy) -5. 8,11.14. 17, 20, 2 3- 
hppl-aoxahept-ariorierane MSI (step »!' figure 3: 
com pound is an intermedi afe and not shown); 
31 Compound 15: Compound 23 (14 6.5 g, 1.63 Mol; Aldrich 

company) and Na (5.1 g, 0.22 Mol) were stirred at 
20°C under inert atmosphere then heated to 65°C until 
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l the complete, dissolution, of Na. The mixture was 

further stirred for 2 hours at 45°C then transferred 
through a canular to a solution of 14 (60.0 g, 0.102 
Mol) in THF (240 ml) . Because of the high viscosity 
of the 23/Na solution. it was kept and transferred at 
6 65'C, while the 14/THF solution was cooled to 0°C. 

After the transfer was completed (1 h), the 
temperature was kept at 0°C for 3 hours then 
overnight at 20°C. The reaction was quenched with a 
saturated solution of NH«C1 (250 ml), THF was 

11 evaporated and the remaining aqueous layer was 

extracted with DCM (3 5 250 ml), dried over MgSO,, 
filtered, and evaporated to dryness. The pure 
compound 15 (C 20 H 42 O„ 25.2 g, 58%) was obtained as a 
viscous colorless oil after flash chromatography 

16 (Si0 2 , MeOH/EA 0-10%). Rf - 0.17 (Si0 2 , MeOH/EA) . l H 

NMR (CDC1 3 ) 5: 3.61-3.52 (m, 28H, CH 2 OCH 2 ) ; 3.45 (t, 
3 J(H,H) = 6.0 Hz, 4H, CHjOH) ; 1.59 (m, 8H, . 
CH 2 CH2CH 2 OH) . 1J C NMR (CDC1 3 ) 5: 71.2, 71M, 70.4, 
70.3, 70.25, 70.2, 69.9 (CH 2 OCH 2 ) ; 62.2 (CH 2 0H) ; 29.8, 

21 26.4 (CH2CH2CH2OH) . FAB*MS (NBA/Nal) : expected exact 

mass for (M+Na 4 )/z 449.2727, observed 449.2738. 

Synthesis of 1 -phtal imido-27-hvdroxv- 
5.8.11.14.17.20.2^ -hPnfaoxahPPfadoderane (161 : 

26 




Compound 16: Compound 15 (23.6 g, 55.3 mMol), 
phtalamide (6.3 g, 42.5 mMol), PPh, (11.2 g, 42.5 




-59- 



mMol) and anhydrous THF (290 ml, distilled over Na) 
were stirred under inert atmosphere. DEAD diethyl - 
azodicarboxylate (7.4 g, 42.5 mMol) in anhydrous THF 
(50 ml) was added slowly using a syringe pump (8.4 , 
mi/h) at 20 °C. After the addition was complete the 
reaction mixture was stirred 1 at 20°C overnight. The 
solvent was evaporated to dryness and the residual 
solid was taken in boiling dH 2 0. After cooling down, 
the precipitate was filtered and the aqueous solution 
was evaporated to dryness under high vacuum. The oil 
thus obtained was chroma tographed (Si0 2 flash, a) DCM, 
b) MeOH/EA 0-10%) yielding 16 as a colorless viscous 
oil (C 2e H« 5 NO, 0 , 11.6 g, 95.2% based on recovered 15). 
Rf - 0.1 (Si0 2 , MeOH/EA 2%). *H NMR (CDC1 3 ) 5: 7.81 
(m, 2H, Ar); 7.70 (m, 2H, Ar) ; 3.69 (t, 3 J(H,H) = 6.9 
Hz, 2H, CH 2 N); 3.63-3.45 (m, 30H, CH 2 OCH 2 and CH 2 OH) ; 
1.74-1.60 (m, 8H, C&CHjCHjO)'. n C NMR (CDC1 3 ) 8: 168.4 
(CO); 133.9, 132.1, 123.1 (Ar) ; 71.3, 70.6, 70.5, 
70.4, 70.1, 70.0 (CH 2 OCH 2 ); 62.6 (CH 2 OH) ; 37.7 (CH 2 N) ; 
30.2, 26.9, 26.6, 25.3 (CH 2 CH 2 CH 2 0) . FAB+MS 
(NBA/Nal): 556 (M+H 4 )/z; expected exact mass for | 
(M+Na*)/z 578.2941, observed 578.2963. 

synthesis of l-Dhta1iTn1c1o-?7-f H.l'- 
hisTnPl-hnxv l-rit-vlloxvl-fi.S. 11.14,17,20.23- 

^pptanxa hPPtadodergne (17) (stPP y i' figure 3; 
r nmpound 17 is an i nt-Pi-medi afp and in thfrpfore, not 
shown) : 

Compound 17: Compound 16 (1.4 g, 2.57 mMol) and 
DMTrCl (1.74 g, 5.1 mMol) in pyridine (10 ml) were 
stirred at 20°C for 48 h. Et 3 N (1 ml) was added and 
the solvent was evaporated to dryness. The oily 
residue thus obtained was taken in DCM (20 ml) and 



• i extracted with dH 2 0 (2 5 20, ml). The organic layer was 

dried over Na 2 S0 4 , filtered, and evaporated to 
dryness, and dried under high vacuum. 17 (C 4 jH o 0 ?2 N, 
1.9 g, 86%). was. obtained as. a pale yellow oil after 
flash chromatography (Si0 2 , , EA/hexane. 50-75%). Rf » 

6 0.33 (Si0 2 , EA) . *H NMR (CDC1,) 6: 7.84 (m, 2H, 

phtalimide); 7.71 (m, 2H, phtalimide) ; 7.44 (d, 
3 J(H,H) - 7.2 Hz, 2H, Ph DMTr) ; 7.32 (d, 3 J(H,H) = 8.8 
Hz, 4H, p-MeOPh DMTr); 7.28 (t, 3 J(H,H) =7.7 Hz, 2H, 
Ph DMTr); 7.20 (t, 3 JW,H) - 7.1 Hz, 1H, Ph DMTr) ; 

H 6.82 (d, >J(H,H) - 8.8 Hz, 4H, p-MeOPh DMTr); 3.79 

(s, 6H, CH,0); 3.71 (t, 3 J(H,H) = 6.9 Hz, 2H, CH 2 N) ; 
3.64-3.42 (m, 28H, CH 2 OCH 2 ) ; 3.06 (t, >J(H,H) = 5.9 
Hz, 2H, CH 2 ODMTr); 1.80-1.59 (m, 8H, CH 2 CH 2 CH 2 0) . 13 C 
NMR (CDClj) 8: 168.5 (CO); 158.2, 145.3, 136.6, 130.0, 

16 128.2, 127.7, 126.5, 112.9 (DMTr Ar) ; 133.9, 132.1, 

• 123.2 (phtalimide Ar) ; 85.8 ( (p-MeOPh) 2 Ph£0) ; 71.3, 
70.6, 70.55, 70.2, 70.0 (CH 2 OCH 2 ) ; 63.0 (CH 2 ODMTr) ; 
55.2 (CHjO); 37.7 (CH 2 N) ; 26.9, 26.7, 26.6, 25.3 
(CH 2 CH 2 CH 2 0) . FAB+MS (NBA/CsI): expected exact mass 

21 for (M+Cs 4 )/z 990.3405, observed 990.3382. j 

■jr^i. of i- a T 11 i n ^-?->-f M^'-hisTnPthoxvl-ritvnoxvl- 
's.R.n.^.ii.^n.^B- ^ fr^ 3 ^^ 6 ^ 3 ^^^ 3 "^ (TV) figure 

3 step i: 

26 

0 

IV 

Compound (IV): Compound 17 (1.1 g, 2.3mMol), (NH 2 ) 2 
(0.4 ml, 12.8 AMol) and EtOH (40 ml) were refluxed 
for 2 h. After cooling down the precipitate was 
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11 



16 



21 



filtered and the solvent evaporated to dryness. The 
residual oil was taken in diethylether and the 
precipitate was filtered off. The organic phase was 
evaporated to dryness yielding IV, {CoH^NO^ 0.85 g, 
91%) which was used in the next step without further 
purification. 

*H NMR (CDC1 3 ) 8: 7.44 (d, 3 J(H,H) - 7.1 Hz, 2H, Ph 
DMTr); 7.32 (d, 3 J(H,H) » 8.9 Hz, 4H, p-MeOPh DMTr); 
7.28 (t, 3 J(H,H) - 7.4 Hz, 2H, Ph DMTr); 7.20 (t, 
3 J(H,H) = 7.1 Hz, 1H, Ph DMTr); 6.82 (d, 3 J(H,H) - 8.9 
Hz, 4H, p-MeOPh DMTr); 3.79 (s, 6H, CH,0) ; 3.65-3.40 
(m, 28H, CH 2 OCH 2 ); 3.05 (t, 3 J(H,H) - 6.0 Hz, 2H, 
CH 2 ODMTr ) ; 2.70 (t, 3 J(H,H) - 6.5 Hz, 2H, ClfcNHj) ; 1.7- 
1.45 (m, 8H, CH 2 CH 2 CH 2 0) . 13 C NMR (CDC1 3 ) 6: 158.2, 
145.3, 136.6, 13:0.0, 128.1, 127.7, 126.5, 112.9. (Ar); 
85.6 # ( (p-MeOPh) 2 Ph£0) ; 71.3, 71.2, 70.5, 70.1, 70.0 
(CH 2 OCH 2 ) ; 63.0 (CH 2 ODMTr) ; 55.2 (CH 3 0) ; 42.0 (CH 2 NH 2 ) ; 
30.4, 27.0; 26.6, 26.5 (CH 2 Cfl 2 CH 2 0) . FAB+MS 
(NBA/CsI): expected exact mass for (M+H 4 )/z 728.4374, 
observed 728.4351; 860 (M+Cs*)/z. 
t 

Svnl-hPsis of 27-phta1imirin-5.fi. 11. 14. 17. 20,23- 
hppl-aoxahepl-adoderanftir acid (V) figure 3 Step m: 




26 Confound (V): Compound 16 (8.6 g, 15.5 mMol) and 

pyridinium dichromate (29.2 g, 77.5 mMol) were 
stirred in DMF (145 ml) under inert atmosphere at 
20°C for 12 h. dH 2 0 (1500 ml) was added and the 
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1 reaction media was extracted with DCM (4 5 1500 ml). 

The organic layers were combined and evaporated to 
500 ml final volume, dried over MgS0 4 , filtered, and 
evaporated to dryness. The residue , was taken in 
diethylether (500 .ml) and filtered on celite to 
6 remove the insoluble materials. The oil obtained . 

after evaporation of the solvent was chromatographed 
(Si0 2 flash, MeOH/EA 0-10%) yielding pure V as a 
colorless viscous oil (C 28 H, 3 NO u , 7.2 g, 82%). Rf = 0.4 
(Si0 2/ MeOH/EA 10%). >H NMR (CDClj) 6: 7.82 (m, 2H, 
11 Ar); 7.70 (m, 2H, Ar) ; 3.67 (t, 3 J(H,H) » 6.9 Hz, 2H, 

CH 2 N); 3.62-3.44 (m, 28H, CH 2 OCH 2 ) ; 2.41 (t, 3 J(H,H) = 
7.2 Hz, 2H, C&COjH); 1.87 (m, 2H, CItCHjCOjH) ; 1.71 
(m, 2H, NCH 2 CH2); 1.61 (m, 2H, NCHjCHjCHj) . "C NMR 
(CDC1 3 ) 5: 176.8 (C0 2 H) ; 168.4 (CO); 133.9, 132.1, 
16 123.2 (Ar); 70.6, 70.5, 70.1 (CH 2 OCH 2 ) ; 37.7 (CH 2 N) ; 

31.0 (£H 2 C0 2 H); 26.9 (£H 2 CH 2 C0 2 H) ; 25.3, 24.8' 
(CHjCHjCHjN) . FAB+MS (NBA/Nal) : 570 (M+H*)/z; expected 
exact mass for (M+Na')/z 592.2734, observed 592.2748; 
614 (M-H*+2Na 4 )/z. 

21 

Synthesis of 27-Uino-S . B , 1 1 , 1 4 , 1 7 , 20 , 23- 
hpptaoxaheptadod^annir acid (18) figure 3, Step i : 



16 

26 Conpound 18: Compound V (2.7 g, 4.7 mMol),. (NH 2 ) 2 

hydrazine (1.5 ml, 48.0 mMol) and EtOH (120 ml) were 
refluxed for 3 h. After cooling down (1 h), the 
precipitate was filtered and the solvent evaporated 
to dryness. The residual oil was taken in 1 M HC1, 
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l filtered on paper, evaporated to dryness and dried 

under high vacuum. It was then taken in 
MeOH/diethylether 30%, filtered and evaporated to 
dryness' yielding IB iri: the hydrochloride salt form, 
{C 20 H« 2 n6 9 C1, 2.0 g, 89.4%) .'This compound can be 
6 further purified on a Dowex AG1X8 (OH") and eluted 

with 1 M HC1 if the starting material (V) was not 
carefully purified. *H NMR (D 2 0) 8: 3.82-3.72 (m, 
24H, OCH 2 CH 2 0); 3.70-3.66 (m, 4H, CH 2 CH 2 CH 2 0) ; 3.15 (t, 
3 J(H,H) - 6.6 Hz, 2H, CHjNHjCl) ; 2.58 (t, 3 J(H,H) - 7.4 

11 Hz, 2H, CHjCC^H) ; 2.03 (m, 2H, C&CHjCOjH) ; 1.85 (m, 

4H, CH 2 CH 2 CH 2 NHjCl) . l3 C NMR (D 2 0) 6: 184.2 (C0 2 H) ; 
72.1, 71.8, 71.5, 71.1 (CH 2 OCH 2 ) ; 41.2 (CH 2 NH 3 C1); 32.5 
(£H 2 C0 2 H) ; 27.6 (£H 2 CH 2 C0 2 H) ; 26.0, 25.7 
(CH 2 CH 2 CH 2 NH 3 C1) . FAB+MS (NBA) : expected exact mass 

16 for (M-CD/z 440.2860, observed 440.2850. 

Synthesis of i>7-Fmoc-ant1rio-5. 8. 11. 14. 17.20. 23- 
hPntaoxahep t adoriPranoir acid (VI) figure 3. Step n : 



6 

VI, X-Fmoc 

VII. X-Boc 

21 

The hydrochloride salt of 18 (1.0 g, 2.1 mMol) and 
10% Na 2 C0 3 (7.8 ml, 7.4 mMol) were stirred in dioxane 
(5.7 ml) and dH 2 0 (7.4 ml) at 0°C. FmocCl (0.6 g, 2.3 
mMol) in dioxane (5.7 ml) was added dropwise over 15 
26 min. The temperature was kept at +4°C for an 

additional 4 hours and 1 hour at 20°C. 0.01 M HC1 
(135 ml) was added, and the mixture was extracted 
with DCM (3 5 70 ml). The organic layers were dried 
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1 over MgSO«, filtered, and evaporated to dryness under 

reduced pressure. After flash chromatography (Si0 2 , 
EA) , VI (C 3S H s ,NO n , 1.2 g, 87%) was obtained as a pure 
, viscous colorless oil. Rf .•■ 0.2 (Si0 2 , EA) . *H NMR 
. (CDClj) 6:, 7.79 (d, 3 J(H,H) = ; 7.4 Hz, 2H, Ar) ; 7.63 

6 (d, 3 J(H,H) - 7.4 Hz, 2H, Ar) ; 7.42 (t, 3 J(H,H) - 7.3 

Hz, 2H, Ar); 7.34 (t, 3 J(H,H) = 7.4 Hz, 2H, Ar) ; 5.23 
(t, 3 J(H,H) - 5.3 Hz, 1H, OCONH); 4.43 (d, 3 J(H,H) = 
6.8 Hz, 2H, C&OCONH); 4.24 (t, 3 J(H,H) - 6.7 Hz, 1H, 
CHCH 2 OCONH); 3.70-3.45 (m, 28H, CH 2 OCH 2 ) ; 3.26 (m, 2H, 
11 OCONHCH 2 ); 2.43 (t, 3 J(H,H) •= 7.4 Hz, 2H, CH 2 C0 2 H) ; 

1.92 (m, 2H, CH 2 CH 2 C0 2 H) ; 1.63 (m, 4H, NCH 2 CH 2 CH.2) . ,3 C 
NMR (CDClj) 6: 173.9 (C0 2 H) ; 156.4 (OCONH); 143.9, 
141.2, 127.5, 127.0, 125.0, 119.8 (Ar) ; 70.8, 70.5, 
70.0, 66.2 (CH 2 OCH 2 ); 51 . 5 s (CHCHjOCONH) ; 47.2 
16 (CHCH 2 OCONH) ; 40.7 (OCONH£H 2 ) ; 30.6 (£H 2 C0 2 H) ; 26.7 

(£H 2 CH 2 C0 2 H) ; 26.6, 24.8 (6CONHCH 2 CH 2 CH?) . FAB+MS 
(NBA/CsI): expected exact mass for (M+Cs*)/z 794.2516, 
observed 794.2527. 



21 Synthesis of 27-Rnr-ami rio-5, 8. 11 . 14 . 17. 20. 23- 

heptaoxaheptarioriecanoic arid (VII) figure 3. step o: 



MO ^ ^ / O 

6 

VI, X « Frooc 

VII, X* Boc 



The hydrochloride salt of 18 (0.84 g, 1.76 mMol) and 
26 Et 3 N (1.0 ml, 7.3 mMol) were stirred in DMF (20 ml) 

under inert atmosphere at 0°C. Boc 2 0 (0.4 6 g, 2.1 
mMol) in DMF (10 ml) was added dropwise over 15 min. 
The temperature was raised to 20 C C and the mixture 
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21 



was stirred for an additional 3 h. dH 2 0 (4 ml) was 
added to destroy excess Boc 2 0 and the solvent was 
evaporated to dryness under high vacuum. The residual 
oil was taken in 0.01 M HC1 (25 ml) and extracted 
with ethylacetate (3 5 25 ml). The organic layers 
were combined and dried over MgS0 4 , filtered and 
evaporated to dryness. After flash chromatography 
(Si0 2 , EA), VII (C 25 H 49 NOn/ 0.8 g, 84%) was obtained as 
a pure viscous colorless oil. Rf * 0.26 (Si0 2 , 
MeOH/DCM 10%). *H NMR (CDCl,) 5: 3.70-3.40 (m, 28H, 
CH 2 OCH 2 ); 3.09 (m, 2H, C&NHCO) ; 2.42 (t, 3 J(H,H) - 7.1 
Hz, 2H, CH 2 C0 2 H); 1.88 (ro, 2H, CH 2 CH 2 C0 2 H) ; 1.55 (m, 4H, 
CONHCH^Cit) ; 1.41 (s, 9H, (CHj) 3 C) . 
"C NMR (CDC1 3 ) 6: 176.9 (C0 2 H) ; 156.0 (OCONH) ; 70.8, 
70.5, 70.4, 70.0, 69.9 (CH 2 OCH 2 and (CH 3 )j£); 40.2 
(OCONHCH 2 ); 30.8 (£H 2 C0 2 H) ; 28.3 (£H 3 ) 3 C); 26.7, 26.65 
(OCONHCH 2 £H 2 £H 2 ) ; 24 . 8' (£H 2 CH 2 C0 2 H) . FAB+MS (NBA/CsI): 
expected exact mass for (M+CsM/z 672.2360, observed 
672.2385; 804 (M-H*+2Cs*) /z. 

FAB+MS (NBA): expected exact mass for (M+H')/z 
540.3384, observed 540.3405. I 



26 



31 



syni-hgsis o f Fmor-Aia, j nf prmftri ia fp for fS8) as 
mnstratPd in figure 4: 

This compound was prepared according to a procedure 
as described in Atherton et al . Solid Phase Peptide 
Synthesis: A Practical Approach; Oxford University 
Press: Oxford, 1989, pp. 47-53. The typical 
procedure is as follows: H 2 NAla 2 OH (5.0 g, 31.2 mMol; 
commercially available from Sigma) and 10% Na 2 C0 3 (83 
ml, 78.0 mMol) were stirred in dioxane (55 ml) and 
dH 2 0 (57 ml) at 0°C. FmocCl (9-f luorenylmethyl 
chloroformate) (8.5 g, 32.8 mMol; Aldrich chemical 
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company) in dioxane (55 ml) was added drbpwise over 
30 min. The temperature was maintained at 0 °C for 1 
hour and at 20°C for 1 h. The precipitate that 
appeared during the course of the reaction was 
dissolved by adding dH 2 0 (360 ml) and dioxane (210 ml) 
followed by stirring at 20°C for 1 h. The reaction 
medium was cautiously acidified to pH 2-3, the 
precipitate formed was filtered, and the liquid phase 
was extracted with ethylacetate (3 5 250 ml) . The 
organic layers were evaporated to dryness under 
reduced pressure leaving a white solid which was 
combined with the precipitate obtained by filtration, 
and suspended in dH 2 0 (250 ml) . It was then filtered, 
washed with dH 2 0 (500 ml) and dried under high vacuum. 
The solid thus obtained was suspended in diethylether 
(250 ml) filtered, washed with diethylether (500 ml) 
and dried under high vacuum yielding FmocAla 2 
(C 2 ,H 22 N 2 0 5 , 9.3 g, 77.5%) as a white solid. M.p. = 
192°C. Rf = 0.35 (Si0 2/ chloroform/ MeOH/ CH 3 C0 2 H 
85/10/5). 'H NMR (DMF d 7 )6: 8.18 (d, 3 J(H,H) = 7.3 
Hz, 1H, OCONH); 7.94 (d, 3 J(H,H) = 7.5 Hz, 2H, Ar) 
7.77 (t, 3 J(H,H) = 7.5 Hz, 2H, Ar) ; 7.53 (d, 3 J (H,H) = 
7.8 Hz, 1H, CONH); 7.44 (t, >J(H,H) - 7.5 Hz, 2H, Ar) 
7.35 (t, 3 J(H,H) = 7.4 Hz, 2H, Ar) ; 4.45-4.20 (m, 5H, 
CHC&OCONH, CHCH 3 ); 1.37 (d, 3 J(H,H) = 7.2 Hz, 6H, 
CH 3 ) . 

"C NMR (DMF d 7 ) 6: 174.8 (C0 2 H) ; 173.2 (CONH); 156.7 
(OCONH); 145.0, 144.9, 141.8, 128.4, 127.8, 126.2, 
126.1, 120.8 (Ar); 67.0 <£H 2 OCONH) ; 51.0 (£HCH 2 OCONH) 
48.5, 47.7 (CHCH,); 18.7, 17.8 (CH 3 ) . FAB+MS (NBA): 
expected exact mass for (M+H')/z 383.1607, observed 
383.1618. 
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1 gynfhp.i^fi of Z-mAla,-mTvr(Ot-Bu)QMe (26) 

W»rm«rila n» for (SB) as illustrated in figure 4: 
Compond 26: Compound 24 (2.35 g, 8 mMol; 
, commercially available from Aldrich) , the 
hydrochloride salt of 25 (2.3 g, 8 mMol; commercially 
6 available from Aldrich) and EtjN (4.5 ml, 32 mMol) 

were stirred in DMF (80 ml) at- 0°C. HBTU (6.0 g, 16 
mMol) in DMF (80 ml) was added dropwise over 1 hour 
and the reaction mixture was stirred at 20°C for 3 
hours and stored at 0°C overnight. dH 2 0 (20 ml) was 
11 added to destroy excess HBTU, and the solvent was 

evaporated to dryness under high vacuum. The solid 
thus obtained was chromatographed (Si0 2 flash, EA/DCM 
0-50%) . 26 was obtained as a white solid (C 2e H 37 N 3 0 7 , 
3.75 g, 91.5%). M.p. = 167°C. Rf = 0.38 (Si0 2 , EA/DCM 
16 50%). *H NMR (CDC1 3 ) 5: 7.34 (m, 5H, Ph Z) ; 6.98 (d, 

3 J(H,H) = 9 Hz, 2H, Tyr Ar) ; 6.89 (d, 3 J(H,H) = 8.5 
Hz, 2H, Tyr Ar) ; 6.80 (d, 3 J(H,H) - 7.5 Hz, 1H, CONH); 
6.73 (d, 3 J(H,H) = 7.5 Hz, 1H, CONH); 5.54 (d, 3 J(H,H) 
- 7.1 Hz, 1H, OCONH); 5.10 (d, 3 J(H,H) - 3.0 Hz, 2H, 
21 CH 2 Ph Z); 4.78 (m, 1H, CHCH 2 Tyr); 4.49 (m, 1H, 

CHCH 3 ); 4.26 (m, 1H, CHCHj) ; 3.65 (s, 3H, C0 2 CH 3 ) ; 3.04 
(d, 3 J(H,H) = 6.1 Hz, 2H, CHCHj Tyr), 1.40-1.20 (m, . 
15H, (CHj)jC and CH 3 ) . "C NMR (CDC1 3 ) 5: 172.1, 171.7, 
171.5 (£0 2 CH 3 , CONH); 156.0 (OCONH); 154.5, 136.1, 
26 130.4, 129.6, 128.5, 128.2, 128.1, 124.2 (Ar) ; 78.4 

((CHjJjC); 67.0 (£H 2 OCONH); 53.4, 52.3, 48.9 
(NH£HRCO); 50.5 (C0 2 £H 3 ) ; 37.2 (CH£H 2 Tyr) ; 28.8 
((CH 3 ) 3 C); 18.7, 18.3 (£H 3 CH). 

FAB+MS (NBA/CsI): expected exact mass for (M+Cs*)/z 
31 660.1686, observed 660.1664. 
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1 common organic solvents. Pure compound 29- (C 2( H 4t N s 0 7 , 

2.7 g, quantitative yield) was obtained as a yellow 
foam from boiling CH 3 CN. M.p. = 198°C. (Rf - 0.12 
(Si0 2 , CH,CN) . l H NMR (DMF d,) 6: 8.29 (brd, 1H, CONH) ; 
8.20 (d, 3 J(H,H) - 7^.0'Hz, 1H, CONH) ; 8 15 (d, 3 J(H f H) 
6 = 7.6 Hz, 1H, CONH); 7.91 (d, 3 J(H,H) = 7.5 Hz, 1H, 

CONH); 7.20 (d, 3 J(H,H) - 8.5 Hz, 2H, Ar) ; 6.93 (d, 
S J(H,H) = 8.4 Hz, 2H, Ar); 4.57 (m, 1H, CHCH 2 Tyr); 
4.38 (m, 3H, CHCHj) ; 3.64 (s, 3H, C0 2 CH } ) ; 3.57 (m, 
1H, H 2 NCHCH3); 3.04 (m, 2H, CHCH 2 Tyr); 1.40-1.20 (m, 

11 21H, (CH,) 3 C and CH 3 ) . "C NMR (DMF d 7 ) 6: 175.8, 173.3, 

173.1, 172.7, 172.6 (CONH and £0 2 CH 3 ) ; 155.0, 132.6, 
130.5, 124.5 (Ar); 78.4 ( (CH 3 ) £) ; 54.8, 51.2, 49.6, 
49.5, 49.2 (NHCHRCO) ; 52.2 (C0 2 £H,) ; 37.2 <CH£H 2 Tyr); 
29.0 ((CH 3 ) 3 C); 21.1, 18.6, 18.4, 18.1 (£H 3 CH) . FAB+MS 

16 (NBA): expected exact mass" for (M+H*)/z 536.3084, 

observed 536.3070. 

Synthesis of R1 ntaratP- (T.) Ala , - ( T.l Tvr (Qt-B\l) QMe ( S8) 
as illustrated in figure 4: 
21 I 




Compound S8: Compound 29 (1 g, 1.9 mMol) and Et 3 N 
(1.0 ml, 7.6 mMol) were stirred at 0°C in DMF (100 
ml). Glutaric anhydride (0.23 g, 1.9 mMol) in DMF (50 
26 ml) was added dropwise over 45 min after which the 

temperature was raised to 20°C for 3 h, then the 
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reaction media was stored overnight at 4°C. The 
solvent was evaporated to dryness under high vacuum 
and the solid obtained was suspended in 0.1 M HC1 (50 
ml) and sonicated (10 inin) . The white precipitate was 
filtered and washed with 0.1 M HCl, dH 2 0, DMF, dH 2 0, 
and EtOH (100 ml each) and dried under high vacuum 
yielding 88 as a white powder (C 31 H« 7 N 5 O i0 / 0.96 g, 
78%). M.p. = 263°C. *H NMR (DMSO d s ) 6: 8.22 (d, 
3 J(H,H) « 7.4 Hz, 1H, CONH); 8.04 (d, 3 J(H,H) = 6.6 
Hz, 1H, CONH); 8.02 (d, 3 J(H,H) - 6.7 Hz, 1H, CONH); 
7.88 (d, 3 J(H,H) « 6.7 Hz, 1H, CONH); 7.86 (d, 3 J(H,H) 
-•7.1 Hz, 1H, CONH); 7.10 (d, 3 J(H,H) - 8.4 Hz, 2H, 
Ar); 6.86 (d, 3 J(H,H) - 8.4 Hz, 2H, Ar) ; 4.41 (m, 1H, 
CHCH 2 Tyr); 4.22 (m, 4H, CHCH,) ; 3.54 (s, 3H, C0 2 CH 3 ) ; 
2.92 (m, 2H, CHC& Tyr); 2.19 (t, 3 J(H,H) = 7.4 Hz, 
2H,CH 2 C0 2 H); 2.13 (t, >J(H,H) = 7.4 Hz, 2H,CH 2 C0NH); 
1.68 (m, 2H, H0 2 CCH 2 Ca 2 ) 125 (s, 9H, (CH 3 ) 3 C); 1.2.2- 
1.10 (m, 12H, CH 3 ). 13 C NMR (DMF d 7 ) 6: 174.9, 174.3,. 
173.7, 173.6, 173.0, 172.7, 172.65 (CONH, C0 2 H, 
£0 2 CH 3 ); 155.0, 132.7,-130.6, 124.5 (Ar) ; 78.6 
((CH 3 )j£); 54.8, 50.6, 50.4, 49.8, 49.4 (NH£HRC0) ; 
52.2 (C0 2 £H 3 ); 37.3 (CH£H 2 Tyr); 35.1, 33.8 
(H0 2 C£H 2 CH 2 £H 2 CONH) ; 29.0 ((£H 3 ) 3 C); 21.5, 18.2, 17.8,. 
18.7 (H0 2 CCH 2 CH 2 , CHCH 3 ) . FAB+MS (NBA/CsI) : expected 
exact mass for (M+Cs*)/z 782.2377, observed 782.2397; 
914 (M-H*+2Cs*)/z. 
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Early Embodiments of the Encoded Reaction cassettes 
The tutrix i 

In early attempts to implement this technology 
we encountered a variety of problems. First; certain 
5 matrix materials , like CPG (Controlled Pore Glass), 

were labile and liberated the polynucleotide- 
substrate hybrid leading to an undesired background 
reaction. We presumed that cleavage of a bond 
between the solid support and the first linker was 

10 responsible for this problem. In addition to the 

problem with its lability, CPG possesses free 
hydroxyl groups on which the polynucleotide chain can 
be grown during polynucleotide synthesis (25) , 
leading to unwanted labile bonds. A second 

15 difficulty was that enzymatic cleavage of the 

cassette was slow and incomplete (data not shown) 
because of steric hindrance of the substrate by the 
matrix and/or the polynucleotide. For this reason we 
had to extend the length of the linkers either 

20 between the solid support and the substrate and/or 

the substrate and the polynucleotide. For our 
purposes TentaGel was found to have much better 
mechanical and chemical properties. Furthermore, it 
possesses a long polyoxyethylene arm which attenuates 

25 the hindrance problem. Finally, the synthesis had to 

be simple so that the cassette can be easily prepared 
in a short time. All these considerations led to the 
strategy depicted in scheme 1. 



30 TentaGel is a tentacle copolymer of PEG 

(polyethylene glycol), and PS (polystyrene). It has 
been used successfully in solid phase peptide 
synthesis, e.g., B. G. de la Torre, B. G. et al. 
Tetrahedron Lett. (1994): vol. 35, pages 2733-2736; 

35 J , Haralambidis et al., Tetrahedron Lett. (1987): 



WO 96/22391 



^^^i&i^^^.,'^-;'-^' ^^^^^ 
-73- 



Matrix 
(TentaGel) 



Siibstratf 
(Solid Ptesc Peptide Synthesis) 



i . XT 



Linker 



TG - Substrate - linker ODMTr 



Solid PJwse 
Polynucleotide Synthesis 



TG - Substrate - Linker - 3' > Protected 45mer > 5' ODMTr 
Deprotection 



Cleavage in 
Organic Media 
(a-Chvmotiyp$in) 



TG - Substrate • Linker - 3 > 45mer > 5' 



Cleavage in 
Aqueous Media 
(a-Chymot rupsin ) 



HO. 



Devrotectwn 




n w JL.Linker - 3" > 45mer > 5 OH 



I) E I 



SCHEME 1 



WO 96/22391 



-74- 



Mm 



vol. 26, pages 5199-5202; and G. Barany, et al. in 
Peptides : Proceedings of the Twelfth American 
Peptide symposium (1992): Escon, Leiden, page 604. 
It has been used successfully in solid phase DNA 
5 synthesis, e;g. , H. Gao et Tetrahedron Lett. 

(1991): vol. 32, pages 5477-5480 and P. Wright et al. 
Tetrahedron Lett. (1993): vol. 34, pages 3373-3376. 
TentaGel has also been shown to be compatible with 
biocatalysts, e.g., L. , Meldal et al. J. Chen. Soc, 
10 Chem. Commun. (1994), p. 1849. It is stable to 

extremes of pH, and can be used in a variety of 
solvents. Its high swelling properties (4-7 times) 
in all usual solvents is an additional feature that 
makes it attractive for reactions involving 
15 biocatalysis. This polymer has the same mobility and 

dynamics as polyethylene glycol which has been used 
as a soluble support for oligonucleotide synthesis, 
e.g., E. Bayer, Angew. Chem., Intl. Ed. Engl. (1991): 
vol. 32, pages 5477-5480. It has a high diffusion 
20 coefficient, sorption, and mechanical stability. The 

reaction kinetics are of the same order as in 
solution because the functional groups are completely 
solvated (supra) . Since the polyethylene glycol part 
of the resin (70-80% w/w) , dominates its 
25 physico-chemical behavior the 

substrate-polynucleotide hybrid grown on it will be 
solubilized in organic or agueous solutions. Finally, 
this matrix is commercially available with different 
functionalities (OH, NH,, SH, C0,H, CHO, Br), which 
30 will facilitate the introduction of different types 

of substrates. 

Synthesis of the Substrate Portion of the Cassette: 

The catalyst chosen to study our cassette methodology 
35 had to be chemically and physically well defined. 



BNS0OCI0:«WO .M22301A1 I > 



.■V-*-.- ■ - ■ 





-75- 



a-Chymo trypsin seemed to be an ideal enzyme for this 
purpose. The substrate portion of the cassette was 
L-Ala 2 - L-Tyr - L-Ala, which is known to be the best 
substrate for a-chymotrypsin, e.g., W;K. Baumann et 
5 al. , Eur. J. Blochem. (1973): vol. 39, 381-391. The 

choice of this substrate was dictated by the fact 
that our goal in these initial studies was to explore 
the lower limit of the sensitivity of the system. 

10 It was not necessary to introduce a spacer 

between the solid support and the substrate since the 
TentaGel matrix is endowed with an extended 
polyethylene glycol arm bearing a terminal functional 
group that allows the direct attachment of the 

15 cassette substrate (Scheme 1) • The loading of the 

matrix is about 280 millimoles/gram. In order to 
avoid the generation of hindered sites, we 
f unctionalized only the most exposed ones (40-60 
millimoles/gram) by adding a large excess of matrix in 

20 the first step of the synthesis and capping the 

unreacted groups. 

1 

We anticipated that the spacing between the 
substrate and the DNA tag was a critical design 

25 feature of the cassette and, thus, four different 

linkers were prepared (Figure 15) . Linker L-I looses 
slowly the Fmoc (9-f luorenylmethoxycarbonyl) 
protecting group inducing the formation of a side 
product that is not practical to separate from the 

30 pure compound. Thus, we prepared the more stable 

derivative L-II which has a CBZ (benzyloxycarbonyl) 
protecting group. Unfortunately, the acetoxy group 
in both L-I and L-II was found to be labile under the 
strongly acidic conditions required for the 
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deprotection of the substrate (CF,CO t H/ethandithiol 
95%, 2 hours) . Accordingly, it is preferable to 
prepare a longer version of L-I, L-III which, does not 
have the labile acetoxy moiety. However, L-III did 
5 not couple efficiently with peptide on the resin. 

Instead, the free amino group (matrix-peptide) 
reacted irreversibly with the Fmoc of the linker, 
e.g., G. B. Fields et al. Int. J. Pep. Prot. Res. 
(1990)L vol. 35, 161-214. Accordingly, these results 

10 teach that L-IV is the preferred linkage agent. All 

data reported herein employs cassettes with this 
linkage agent. Since only the above three linkage 
agents have been tested to date, there is a 
possibility that other as yet untested linker 

15 appendages may prove superior. 



The peptide sequence selected as an exemplary 
substrate requires a deprotection step on the 
Tyrosine-O-t-Bu with CF 3 C0 2 H/ethandithiol, 95% for 2 

20 hours. Since this deprotection can not be performed 

after the polynucleotide synthesis (concentrated acid 
leads to many side reactions on the polynucleotide) , 
it was necessary to change the Tyrosine protecting 
group from a t-Bu to a base labile protecting group 

25 which can be removed at the same time as the 

deprotection of the DNA. Scheme 2 illustrates a 
pathway employed for protecting group exchange and 
for the introduction of the linker. Aside from its 
utility for the present experiment, this synthesis 

30 shows that the system can be readily transformed and 

modified chemically so as to be compatible with a 
variety of substrates. 
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Synthesis of the Polynucleotide Portion of the 
Cassette* The polynucleotide sequence is shown in 
Figure 16. It possesses two primer sequences and one 
encoding sequence which identifies the substrate 

which in this case is a pentapeptide where each amino 

* ■ * 

acid is arbitrarily assigned a triplet nucleotide 
sequence. Obviously, any nucleotide sequence may be 
used to encode the nature of the substrate and the 
choice of the nature of the code will depend 
primarily on the number and complexity of test 
substrates . 

Standard phosphoramidite methodology using CPG 
solid support, on a 3 94 Applied Biosystem DNA 
Synthesizer, was not efficient with the TentaGel 
matrix, e.g. M.J. Gait, Ed. (1990) Oligonucleotide 
Synthesis, a Practicable Approach (Oxford University 
Press, New York) . The yield per step dropped from 
"98% to "85%. After 45 steps, the overall yield with 
the CPG was "40% and only "0.07% with the TentaGel. 
Modification of the classical procedure was required. 
After many trials, three major modifications of the 
procedure (see materials and methods) were found to 
increase the yield per step from "85% to "97%, 
corresponding to an increase in the overall yield 
from "0.07% to "25%. The polynucleotide encoded 
peptide thus obtained was submitted to concentrated 
ammonia to deprotect the polynucleotide and the 
peptide, followed by 3% trichloroacetic acid in 
dichloromethane treatment to remove the 
dimethoxytrityl protecting group. 

Reaction Specificity. The cassette was submitted 
to enzyme cleavage and the results after the 
amplification by PCR of the liberated polynucleotide 



are illustrated in Figure 17. 

Lanes 1-8 show the results after incubation at 
20 *C for ,3 6: minutes with trypsin, pepsin, papain, 
carboxypeptidase A, proteinase K, a-chymotrypsin, 
a-chymotrypsin + Bowman-Birk' inhibitor, and no 
enzyme, e.g., Y. Birk, Y. Int. J. Peptide Protein 
Res. (1985): vol. 25, pages 113-131. Lane 9 
corresponds to the positive control and lane 10 to 
the negative control. 

The data in lane 6 show that in the presence of 
a-chymotrypsin a band corresponding to 45 nucleotides 
is present, indicating a net cleavage of the 
substrate by this enzyme. This interpretation is 
further supported by the control experiments. When 
the cassette is incubated with trypsin, pepsin, 
papain, or carboxypeptidase A, no band could be 
detected on the agarose gel which is in agreement 
with the specificity of these enzymes. When the 
Bowman-Birk inhibitor is added to a-chymotrypsin 
(lane 7), no cleavage is detected. As expected, ir! 
the presence of proteinase K a band is detected 
indicating a net cleavage by this enzyme. The 
intensity of the band indicates that the cleavage by 
proteinase K is weaker than that accomplished by 
a-chymotrypsin. This result is in agreement with the 
fact that a-chymotrypsin is specific for the 
substrate used in this study. In the absence of any 
enzyme (lane 8) no cleavage is detected after 30 
minutes . 

Under the reaction conditions used in this 
study, the enzymatic activity of 1 picomole of 
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ar-chymotrypsin was readily detected. One should be 
able to improve the sensitivity of detection since 
the substrate concentration (29.5 mM) used in our 
experiment was veil below saturation (32). 
Additionally, preliminary experiments have shown that 
a longer linker between the substrate and the 
polynucleotide enhance the accessibility of the 
enzyme to the substrate (data not shown) . 

An alternative to analyzing the PCR products on 
agarose gel, which can become laborious when 
libraries of catalysts are being screened, one can 
simply add to the reaction mixture a fluorescent 
probe that undergoes fluorescence enhancement upon 
intercalation into the DNA. The insert in figure 17 
shows a photograph taken under UV light (2 54 nm) of 
the reaction media in the presence of the YOYO-1 . 
The first well corresponds to the experiment in lane 
6, the second well to the experiment in lane 8, and 
the third to the probe in buffered solution without 
any additives. The greatest fluorescence enhancement 
is in the first well which contains the amplified | 
DNA. The second well shows a background fluorescence 
resulting from the interaction of the probe with the 
primers. As expected, the third well does not show 
any detectable fluorescence. Another advantage of 
the Y0Y0-1 probe is that the amount of the PCR 
product (which should be directly related to the 
efficiency of the enzyme cleavage) can be quantified, 
e.g., M. Ogura et al., Biotechniques (1994): vol* 16 , 
pages 1032-1033. 

Reaction Sensitivity. It was interesting to note 
that after 24 hours, in the absence of 
a-chymotrypsin, a band corresponding to the DNA 4 5mer 
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was detected (data not shown) . This background 
reaction can be due to bond solvolysis anywhere 
between the solid support and the first bases of the 
polynucleotide, or isinply to a leakage from the 
matrix. In an attempt to define the cleavage site(s) 
we prepared the same cassette lacking the substrate 
unit (where the polynucleotide Is directly linked to 
the matrix via a mixed phosphodiester bond) . When 
this matrix and the standard cassette are incubated 
separately without any catalyst, one can detect a 
background reaction after 16 hours. The ease of 
detection of this uncatalyzed reaction increases 
between 29 hours and 51 hours. 



If this uncatalyzed reaction originated from 
leakage from the matrix or solvolysis of a 
phosphodiester bond, the 45mer DNA would have been 
detected at the same time irrespective of whether the 
cassette contained a substrate unit. The fact that 
detectable cleavage after 16 hours is limited to 
cassettes containing the peptide substrate indicates 
that bond solvolysis occurs in the substrate 
sequence, most likely at a peptide bond. After 16 
hours when background cleavage of the peptide bond is 
observed, solvolysis of the phosphodiester bond is 
not detected. However, by 29 hours solvolysis of the 
phosphodiester bond is detected. 

Assuming that the rate constant for a peptide 
bond hydrolysis is "3 x 10** s' 1 (t l/2 " 7 years) , at a 
cassette concentration of 29.5 mM the velocity for 
peptide bond hydrolysis is '9 x 10" M/s, e.g., D. 
Kahn & W.C. Still, J. Am. Chem. Soc. (1988): vol. 
110, pages 7529-7534. After 15 hours one would 
expect to have "5 nanomoles of free polynucleotide in 
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solution. This amount is known to be readily 
detectable by the PCR. Since the rate constant for 



(1987): vol. 26, pages 1289-1300. 

Practicality of the Encoded cassette System. 

It herein disclosed that the cassette system works 
reproducibly, and that the entire cassette can be 
assembled in less than 48 hours using conventional 
synthetic chemistry. Because of the simplicity and 
versatility of our methodology, analysis of a large 
number of potential catalysts can be carried out in 
less than 4 hours. Although, the current limit of 
detection is about "1 picorooles, the sensitivity and 
efficiency of this system can be readily improved. 
These improvements may be achieved either by 
increasing the concentration of the substrate, and/or 
its loading on the solid support and/or introducing 
longer linkers between the substrate and the 
polynucleotide. 

The system is not limited to transformations in 
which bond cleavage or bond formation is the initial 
event. The only requirement is that the chemical 
transformation make a bond labile to other reagents. 
For example, in the search for dihydroxylation 
catalysts, an olefin can be used as a substrate 
because when it is dihydroxylated it can be 
selectively cleaved by periodate. Additionally, one 
can envision systems in which the transformation 
modifies the cassette such that it now becomes a 



phosphodiester bond hydrolysis is much slower (5.7 X 
10 M s 1 ) than that of peptide bond hydrolysis < the 




substrate for a known enzyme/ e.g., K. Morikava et 
al., J. Am. Chem. Soc. (1993): vol. 115, pages 
8463-8464. 

Finally, even using PCR conditions that are not 
yet optimized, it is herein demonstrated that one is 
able to detect in a matter of hours uncatalyzed 
chemical reactions with half -lives of years. This 
demonstrates that essentially any catalytic bond 
cleavage or formation event can in principle be 
readily detected in a very short time. The method 
should be applicable to detection of events that are 
of low efficiency either because the enzyme is poor 
or, more importantly, because the catalyst is only 
one member of a large library and is, thus, present 
in low concentration. 

MATERIALS AND METHODS 

The chemicals were purchased from Novabiochem 
and Aldrich for peptide synthesis, from Millipore for 
DNA synthesis, and from Promega for the PCR 
experiments. The Y0Y0-1 probe was purchased from 
Molecular Probes Inc. The solvents were purchased 
from Fisher or Baxter (water content < 0.001%). For 
synthesis of the linkers, the chemicals were 
purchased from Aldrich, and were used without any 
further purification. 

Substrate Synthesis. TentaGel is commercially 
available from Novabiochem or Rapp Polymere 
(Germany) . The peptide was assembled according to 
standard Fmoc methodology, e.g., A. Aherton & R.C. 
Sheppard, Solid Phase Peptide Synthesis : A 
Practicable Approach (1989): Oxford University Press. 
In a typical procedure, 3 equivalents of the coupling 



reagent for aside bond formation, 

2- (lH-Benzotriazole-l-yl) -1,1,3,3 tetramethyluronium 
Hexafluoro-phosphate (HBTU) , 3 equivalents of 
N-Hydroxybenzotriazole (HOBt) , 6 equivalents of 
•N,N-diisopropyl-ethylanine\. (DIEA),, and 3 equivalents 
of the N-a(9-f luorenylmethoxycabonyl) -amino acid 
(Fmoc-aa) are added in dimethylacetamide to the resin 
swollen in dichloromethane (DCH) . The coupling was 
completed in less than an hour as judged by the 
Kaiser test, e.g., E. Kaiser et al. Analyt. Biocham. 
(1970): vol. 34, page 595 and V. K. Sarin et al. 
Analyt. Biochem. (1981): vol. 117, 147. After each 
step the resin was washed with N , N-dimethy f ormamide , 
methanol, and DCM. The Fmoc protecting group was 
removed upon treatment with 20% piper idine in 
N, N-dimethy If ormamide (2 X 10 minutes). The yield of 
each step was determined by the titration of the Fmoc 
group from a small sample, e.g., A. Atherton, A. & 
R.C. Sheppard, R. C, Solid Phase Peptide Synthesis 
: A Practicable Approach (1989): Oxford University 
Press, p. 107. To have a final loading of 40-60 
millimoles/gram, the first step of the synthesis was 
performed with four fold excess of solid support to 
Fmoc-aa. After all the final washings, the unreacted 
amino groups were capped (0.25 volume of acetic 
anhydride 4.23 M in 2 , 6-lutidine; 0.75 volume of 
N, N-dimethy larainopyr idine, 0.53 M in THF, 2 X 10 
minutes) . The overall yield for peptide synthesis 
was "98%. 

The t-Butyl protecting group for the hydroxyl 
moiety on tyrosine was removed by treatment with 
CF 3 C0 2 H/ethandithiol 95% for 2 hours, followed by 
extensive washing with DCM, methanol, and 
N, N-dimethy If ormamide. The Fmoc protecting group was 
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removed before coupling to the linker (see scheme 
II) . The matrix containing Tyros ine-o-L-IV was 
converted to Tyros ine-O-acetyl after selective 
deprotectipn of the phenolic ring (concentrated NH 4 OH, 
3 hours) ''and'^capping -*'(bV25 'volume of "acetic 
anhydride, 4 ;23 M in 2,6-lutidine, 0.75 volume of 
N,N-dimethylaminopyridine, 0.53 M in THF, 30 min) . 
The yield after each step was determined by the 
dimethoxytrityl cation assay, e.g., M. J. Gait, 
Oligonucleotide Synthesis : A Practicable Approach 
(1990): Oxford University Press, p. 48. 

Linker L-IV was prepared in one step from the 
sodium salt of 4-hydroxybutyrate and dimethoxytrityl 
chloride in pyridine, e.g., H. Schaller et al., J . 
Am. Chera. Soc. (1963): vol. 85, pages 3821-3827. 

t 

DNA Synthesis. DNA synthesis was carried out on 
a 394 Applied Biosystem DNA synthesizer. The 
standard 1 millimole cycle was modified as follows : 
1) All washing steps 3, 59, 61, 66, 77, and 94 were 
prolonged to 30 seconds. The use of longer or 
shorter times decreased the yield. 2) The incubation 
time with phosphoramidite and tetrazole (step 45) was 
prolonged from 25 seconds to 120 seconds. 3) The 
concentration of the phosphoramidites was increased 
from 0.1 M to 0.2 M. The bases were deprotected upon 
treatment with concentrated NH 4 OH for 20 hours at 
55 °C. The dimethoxytrityl group is removed upon 
treatment with 3% trichloroacetic acid in DCM (5 
minutes) , followed by extensive washing with DCM, 
tetrahydrofuran, methanol, tris-HCl buffer (20 
millimolar, pH 8, NaCl 160 millimolar) , and dH 2 0 
(deionized water) . After this step, the cassette is 
ready for use. 
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under UV light (254 nm) . 

Uneatalysed Reactions. The cassette lacking the 
substrate unit was, prepared as follows: TentaGel 
bearing a hydroxy 1 group functionality (1 g) was 
shaken with dimethoxytrityl chloride (10 eq, 85 mg) 
in pyridine (4 ml) at room temperature for 3 days. 
Titration of the dimethoxytrityl group showed a 
loading of 32 mil limole/ gram. The unreacted hydroxy 1 
groups were acetylated (0.25 volume of acetic 
anhydride, 4.23 M in 2,6-lutidine, 0.75 volume of 
N,N-dimethylaminopyridine, 0.53 M in THF; 30 
minutes) . DNA synthesis was performed on this matrix 
following the procedure described above. 

The cassette lacking the substrate (0.5 mg, 12.2 
millimoles/gram) and the cassette with the substrate 
unit (1 mg, 6.2 millimoles/gram) were suspended 
separately in 20 ml of tris-HCl buffer (20 
millimolar, pH 8, NaCl 160 millimolar) and 180 
milliliter of dH 2 0. The mixtures were shaken at 20 °C 
and aliquots (18.7 ml) taken after 2 hours, 16 hours, 
29 hours, and 51 hours were subjected to the PCR. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Janda, Kim 0 

Fenniri, Hicham 
Lerner, Richard A 



(ii) TITLE OF INVENTION: ENCODED REACTION CASSETTE 



(iii) NUMBER OF SEQUENCES t 14 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: The Scripps Research Institute 

(B) STREET: 10666 North Torrey Pines Road, TPC-8 

(C) CITY: La Jolla 

(D) STATE: California 
<E) COUNTRY: US 

(F) ZIP: 92037 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk | 

(B) COMPUTER: IBM PC compatible 

<C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US/96 

(B) FILING DATE: 18-JAN-1996 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/374,050 

(B) FILING DATE: 18-JAN-1995 
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(viii) ATTORNEY / AGENT INFORMATION I 

(A) NAMEs Levia, Donald G 

(B) REGISTRATION NUMBERS 28,636 

(C) REFERENCE/DOCKET NUMBERS 445 ♦ IPC 



(ix) TELECOMMUNICATION INFORMATIONS 

(A) TELEPHONE: (619) 554-2937 

(B) TELEFAX: <619) 554-6312 



(2) INFORMATION FOR SEQ ID NOsls 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTHS 4 amino acida 

(B) TYPEs amino acid 

(C) STRAND ED NESS s eingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NOsls 

Gly Gly Phe Gly 

1 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acida 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION t SEQ ID NO: 2: 

Alt Ala Phe Ala Ala 

1 5 



?dYUS9o/ou&*a 



(2) INFORMATION FOR SEQ ID NO: 3 J 



r. 



(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRAND ED NESS t single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
Ala Ala Tyr Ala Ala 

I 1 5 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 




BNSDOCID: <WO .9GS2391A1 .!_> 



(ix) FEATURE: 
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<A) NAME/XEYi Modif iad-sita 
(B) LOCATION t 5 

(D) OTHER INFORMATION s /labal- Xaa 

/not#- "Xaa if Alanina-OlutaratsT 



(xi) SEQUENCE DESCRIPTION t SEQ ID NOs4t 

Ala Ala Tyr Ala Xaa 

1 5 



(2) INFORMATION FOR SEQ ID NOlSi 



(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTHS 9 amino acids 

(B) _TYPEi amino acid 

(C) STRANDEDNESS s aingla 

(D) TOPOLOGY s linaar 

(ii) MOLECULE TYPE! DNA (ganomic) 



(xi) SEQUENCE DESCRIPTION* SEQ ID NOi5i 

Ala Ala Ala Ala Tyr Ala Ala Ala Ala 

1 5 

(2) INFORMATION FOR SEQ ID NOs6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LSNGTHt 5 amino acids 

(B) TYPEs amino acid 

(C) STRANDEDNESS s sing la 

(D) TOPOLOGY s linaar 
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(ii) MOLECULE TYPE* peptide 



(ix) FEATURE l 

(A) NAME/ KEY i Modif ied-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION! /label- Xaa 

/note- *Xaa is Methylated Tyrosine-o-t-Butyl* 



(ix) FEATURE t 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION! /label- Xaa 

/note- "Xaa is Alanine-Glutarate" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 

Xaa Ala Ala Ala Xaa 

1 5 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

'•. : \ ( " (A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

■'"•'V," . (D) OTHER INFORMATION: /label- Xaa 

■■■Of J if* 
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/not*- "Xaa ie Tyroaine-o-t-Butyl* 



(xi) SEQUENCE DESCRIPTION t SEQ ID NO: 7: 



Ala Ala Xaa Ala Ala 



1 



5 



(2) INFORMATION FOR SEQ ID NOtBt 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH t 45 base pair a 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:0: 
GATTCTTATC CCGCGCTGAT CGTCCTCGAG GOAACCCTTC ATCGA | 



(2) INFORMATION FOR SEQ ID NOt9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



45 
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(xi) SEQUENCE DESCRIPTION! SEQ ID NOi9t 

AGCTACTTCC CAAGGGAGCT GCTGCTACTC CGCCCCTATT CTTAO 
45 

(2) INFORMATION FOR SEQ ID NO J 10 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 bass pain 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 
<D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CTAAGAATAG GGCCCGACTA GCAGCAGCTC CCTTGGGAAG TAGCT 
45 

(2) INFORMATION FOR SEQ ID NOill: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 

' ' V/'f v. 




BNS0OCID:<WO ©C2Z»1A1_L> 




GGCGGCTTCC CAAGGCAGCT OCTOCTXOTC TATTCTTAOG GOCCC 
45 

(2) INPORMATlbN FOR SEQ ID NOtl2t 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTHS 45 base pairs 

(B) TYPE j nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY i linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GGCGGCTTCC CAAGGGAGCT GCTGCTAGTC TAGGCGTAGG GGCCC 
45 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) ~ TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GGCGACGTGA TGGGCAATTT GATGATAGAC TAGGCGGAGG CGAGG 
45 



9C?2301A1 t > 




(2) INFORMATION FOR SSQ ID NOtl4t 



(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH i 45 base pairs 

( B ) TYPE i ' nucleic acid 

(C) STRANDEDNESSt ling la 

(D) TOPOLOGY: linitr 

(ii) MOLECULE TYPEs DNA (ganotnic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CGCAGGGTGA GTAAGGCAGC ATACGCAGCA ATAGTGGACG GAGCG 
45 



i 



i 
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What is claimed is: 

1. An encoded reaction cassette for assaying a 
cleavage reaction .comprising: * 

a solid phase matrix; 

a substrate covalently linked to said solid 
phase matrix, said substrate being susceptable to 
cleavage by means of the cleavage reaction; and 

a first polynucleotide linked to said substrate, 
said first polynucleotide including a first PCR 
primer sequence, an encoding sequence, and a second 
PCR primer sequence, said encoding sequence 
positioned between said first and second PCR 
sequences . 

2. An encoded reaction cassette as described in 
Claim 1 further comprising: 

a first linker covalently linking said solid 
phase matrix to said substrate and 

a second linker covalently linking said 
substrate to said first polynucleotide. 

3. An encoded reaction cassette as described in 
Claim 1 wherein: 

said substrate is a polypeptide. 

4. An encoded reaction cassette as described in 
Claim 1 further comprising: 

a first linker covalently linking said solid 
phase matrix to said substrate, 

a second linker covalently linking said 
substrate to said first polynucleotide, and 

said substrate is a polypeptide. 
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5. An encoded reaction cassette as described in 
Claim l further comprising: 

a second polynucleotide linked to said 
substrate; said second polynucleotide including the 
first PCR primer sequence, the encoding sequence, and 
the second PCR primer sequence. 

6. An encoded reaction cassette as described in 
Claim 2 further comprising: 

a third linker covalently linking said substrate 
to said second polynucleotide. 

7. An encoded reaction cassette as described in 
Claim 1 wherein: said substrate is a polypeptide 
having a susceptibility to proteolytic cleavage by a 
protease and said encoding sequence encodes the 
polypeptide. 

8. An admixture of cleavage products from an 
encoded reaction cassette comprising: 

a solid phase cleavage product and 

a soluble phase cleavage product forming an 
admixture with said solid phase cleavage product, 

said solid phase cleavage product including a . 
solid phase matrix and a first cleavage product of a 
substrate covalently linked to a solid phase matrix, 

said soluble phase cleavage product including a 
second cleavage product of the substrate covalently 
linked to a first polynucleotide, said first 
polynucleotide including a first PCR primer sequence, 
an encoding sequence, and a second PCR primer 
sequence, said encoding sequence separating said 
first and second PCR sequences. 
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9. An admixture of cleavage products from an 
encoded reaction cassette as described in Claim 8 
further comprising: 

said solid phase cleavage product including a 
first linker covalehtly linking said solid phase 
matrix to said first cleavage product and 

said soluble phase cleavage product including a 
second linker covalently linking said second cleavage 
product to said first polynucleotide. 

10. An admixture of cleavage products from an 
encoded reaction cassette as described in Claim 8 
wherein: 

said first and second cleavage products are 
partial polypeptides. 

11. An admixture of cleavage products from an 
encoded reaction cassette as described in Claim 8 
further comprising: 

said solid phase cleavage product including a 
first linker covalently linking said solid phase 
matrix to said first cleavage product, I 

said soluble phase cleavage product including a 
second linker covalently linking said second cleavage 
product to said first polynucleotide, and 

said first and second cleavage products are 
fragments of a polypeptide. 

12. An admixture of cleavage products from an 
encoded reaction cassette as described in Claim 8 
further comprising: 

a second polynucleotide linked to saidsecond 
cleavage product of the substrate, said second 
polynucleotide including the first PCR primer 



mm 



sequence, the encoding sequence, end the second PCR 
primer sequence. 
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13. An admixture of cleavage products from an 
encoded reaction cassette as described in Claim 8. 
further comprising: 

a third'iinker cbvalehtly linking "said second 
cleavage product to said second polynucleotide. 

14. A method for detecting a cleavage agent within a 
sample, the method comprising the following steps: 

Step A: admixing the sample with an encoded 
reaction cassette for producing a mixture of cleavage 
products, the mixture of cleavage products 
potentially including: 

a solid phase cleavage product including a 
solid phase matrix and a first cleavage product 
of a substrate covalently linked to a solid 
phase matrix; 

a soluble phase cleavage product including 
a second cleavage product of the substrate 
covalently linked to a first polynucleotide, 
said first polynucleotide including a first PCR 
primer | sequence, an encoding sequence, and a 
second PCR primer sequence, said encoding 
sequence separating said first and second PCR 
sequences ; and 

uncleaved encoded reaction cassette; then 
Step B: separating and isolating the soluble 
phase cleavage product from the solid phase cleavage 
product and uncleaved encoded reaction cassette; then 

Step C: amplifying the encoding sequence of the 
polynucleotide of the soluble phase cleavage product 
separated and isolated in said Step B by means of 
PCR; and then 

Step D: detecting the encoding sequence 

amplified in said Step C. 
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15. A method for detecting a cleavage agent within, a 
sample as described in said Claim 14 comprising the 
following further step after said Step D: 

Step E: correlating the detection of amplified 
encoding sequence of said Step D with the presence of 
the cleaving agent. 
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20, A method for detecting a cleavage agent within a 
sample as described in said Claim 18 wherein the 
cleavage agent is a protease and: 

in said Step A: the substrate included within 
the encoded reaction cassette is a polypeptide 
susceptible to cleavage by the protease. 
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21. An admixture o£ unligated reactants for 
producing an encoded ligation cassette for assaying a 
ligation reaction, the admixture comprising: 

a solid phase ligation component and 

a soluble phase ligation component forming an 
admixture with said solid phase ligation component, 

said solid phase ligation component including a 
solid phase matrix and a first ligation reactant 
covalently linked to the solid phase matrix, 

said soluble phase ligation component include a 
second ligation reactant covalently linked to a first 
polynucleotide, said first polynucleotide including a 
first PCR primer sequence, an encoding sequence, and 
a second PCR primer sequence, said encoding sequence 
position between said first and second PCR sequences, 

said first and second ligation reactants being 
ligatible for joining the solid phase ligation 
component to the soluble phase ligation component and 
forming an encoded ligation cassette. 

22. An admixture of ligation components as described 
in Claim 21 further comprising: 

said solid phase ligation component including a 
first linker covalently linking said solid phase 
matrix to said first ligation reactant and 

said soluble phase ligation component including 
a second linker covalently linking said second 
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ligation reactant to said first polynucleotide. 

23. An admixture of ligation components as described 
in Claim 21 wherein: 

said first and second ligation reactants are 
fragments of a ligatible oligonucleotide. 

24. An admixture of ligation components as described 
in Claim 21 further comprising: 

said solid phase ligation component including a 
first linker covalently linking said solid phase 
matrix to said first ligation reactant, 

said soluble phase ligation component including 
a second linker covalently linking said second 
ligation reactant to said first polynucleotide, and 

said first and second ligation reactants are 
fragments of a ligatible oligonucleotide. 

25. An admixture of ligation components as described 
in Claim 21 further comprising: 

a second polynucleotide linked to said ligation 
product, said second polynucleotide including the 
first PCR primer sequence, the encoding sequence, and 
the second PCR primer sequence. 



26. An admixture of ligation components as described 
in Claim 21 further comprising: 

a third linker covalently linking said second 
ligation reactant to said second polynucleotide. 



27. An encoded ligation cassette for assaying a 
ligation reaction comprising: 
a solid phase matrix; 

a ligation product covalently linked to said 
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solid phase matrix; and 
J;: a first polynucleotide linked to said ligation 
^ - product, said first polynucleotide including a first 

PCR primer sequence, an encoding, sequence, and a 
second PCR primer sequence, said encoding sequence 
' separating said first and second PCR sequences. 

28. An encoded ligation cassette as described in 
Claim 27 further comprising: 

:'■) a first linker covalently linking said solid 

phase matrix to said ligation product and 

a second linker covalently linking said ligation 
product to said first polynucleotide. 

29. An encoded reaction cassette as described in 
Claim 27 wherein: 

said ligation product is an oligonucleotide. 
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30. An encoded ligation cassette as described in 
Claim 27 further comprising: 

a first linker covalently, linking said solid 
, phase matrix to said ligation product, 

a second linker covalently linking said ligation 
product to said first polynucleotide, and 

said ligation product is an oligonucleotide. 

31. An encoded ligation cassette as described in 
Claim 27 further comprising: 

a second polynucleotide linked to said ligation 
product, said second polynucleotide including the 
first PGR primer sequence, the encoding sequence, and 
the second PCR primer sequence. 

32. An encoded ligation cassette as described in 
Claim 27 further comprising: 

a third linker covalently linking said ligation 
product to said second polynucleotide. 

33. An encoded ligation cassette as described in. 
Claim 27 wherein: said ligation product is a 
polynucleotide having a susceptibility to ligation by 
a ligase and said encoding sequence encodes the 
oligonucleotide . 

34. An encoded ligation cassette produced by 
incubating a ligating agent with an admixture of 
ligation components, 

said admixture of unligated reactants including: 
a solid phase ligation component and 
a soluble phase ligation component forming 

an admixture with said solid phase ligation 

component , 
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said solid phase ligation component , 
including a solid phase matrix and a first 
ligation reactant covalently linked to the solid 



phase matrix, < 

said soluble phase ligation component 
including a second ligation reactant covalently 
linked to a first polynucleotide, said first 
polynucleotide including a first PGR primer 
sequence, an encoding sequence, and a second PCR 
primer sequence, said encoding sequence position 
between said first and second PCR sequences, 
said first and second ligation reactants being 

ligatible for joining the solid phase ligation 

component to the soluble phase ligation component and 

forming an encoded ligation cassette, 

said ligating agent having a ligation activity 

with respect to said first and second ligation 

reactants, 

35. An encoded ligation cassette as described in 
claim 34 wherein: 

said first and second ligation reactants are 
ligatible oligonucleotides; and 

said ligation agent is a nucleotide ligase. 

36. A method for detecting a ligating agent within a 
sample, the method comprising the following steps: 

Step A: admixing the sample with an admixture of 
ligation components for producing an encoded ligation 
cassette, the admixture of ligation components 
including: 

a solid phase ligation component including 
a solid phase matrix and a first ligation 
reactant covalently linked to the solid phase 

matrix; and 
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a soluble phase ligation component 
including a second ligation reactant covalently 
linked to a first polynucleotide, said first 
polynucleotide including a first PCR primer 
sequence, an encoding sequence, and a second PCR 
primer sequence, said encoding sequence 
positioned between said first and second PCR 
sequences; then 

Step B: separating and isolating the encoded 
ligation cassette formed in said Step A together with 
unligated portions of the solid phase ligation 
component from the unligated portion of the soluble 
phase ligation component; then 

Step C: amplifying the encoding sequence of the 
polynucleotide of the encoded ligation cassette 
separated and isolated in said Step B by means of 
PCR; and then 

Step D: detecting the encoding sequence 
amplified in said Step C. 

37. A method for detecting a ligation agent within a 
sample as described in said Claim 36 comprising the 
following further step after said Step D: 

Step E: correlating the detection of amplified 
encoding sequence of said Step D with the presence of 
the ligation agent. 

38. A method for detecting a ligation agent within a 
sample as described in said Claim 36 wherein the 
ligation agent is a ligase and: 

in said Step A: the ligation product included 
within the encoded ligation cassette is a 
polynucleotide susceptible to ligation by the ligase. 
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Sctssite bond 
P3' P2' II P1 P2 P3 



HO 



2'JJ P1 F 



Vr.r 



HO' 



HO 



HO 



O " ' O 
HO' 



51 = Gly r (L)Phe-G»y 

52 » Gty 2 -(D)Phe-Gly 

53 = (L)Ala 2 -(L)Phe-(L)Ala 2 



54 o (L)Ala 2 -(L)Tyr-(L)Ata 2 

55 r {D)Aia r (D)Tyr.(D)AJa 2 

56 = (L)Ala r (l)Tyr-(L)Ala 2 -Glutarate 

57 = (L)Ala 4 -(L)Tyr-(L)Ala 4 

S6 = (L)MeOTyr(Ot-Bu)-(L)Ala 4 -Glutarate 



51 = SEQUENCE 

52 = SEQUENCE 
S3 1= SEQUENCE 

54 = SEQUENCE 

55 b SEQUENCE 

56 = SEQUENCE 

57 = SEQUENCE 

58 - SEQUENCE 



ID NO: 1 
ID NO: 1 
ID NO: 2 
ID NO: 3 
ID NO: 3 
ID NO: 4 
ID NO: 5 
ID NO: 6 



Figure 4 
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NovaSyn KD^s^ 



Standard SPPS using 
spacer VI (or VII), Fmoc-Ala 2 
Fmoc-Tyr(OSi*-BuMe2), 
and spacer I 



NSKD^n 



Attachement of spacer VIII 
followed by DNA synthesis using 
0-cy anoethyl phosphoramidites . 



Ready for ONA synthesis 



CN 



1- Coned NH 3 , 1 h, 55°C 
2* 1 M TBAF/THF, 15 min 
3- Extensive washings 



NSKD^^ K 



<5> 



45 Na 



5'-Polynticleotide-3* 



a-Chymotrypsin cleavage 



HO, 




Ready for use in bond 
cleavage detection 



Na w 



''"S'-Polynudeotide-S" 
45 Na® 



PCR on solution 



Figure 5 
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QPQ^s^Flmt nucleotide 



1- DNA synthesis using methyl phosphoramldites 

2- Attachment of spacers VIII and IX (or X) 



' CPG 5-Protecled Polynucleotide^ ^-^s^o vX^o* p * 



1- AcOH 60% 

2- Fmoc-A!a 2 , HBTU, HOBt, DIEA 



0 o 

CPG^v^S'-Protected Polynucleotide^ 1 ^ >! p 
o 0 . H Na € © 




1- Demethylation 

2- Coned NH 3 , 55°C, 16 h 

3- Liophilization 

4- Ethanol precipitation 



1- Demethytation 

2- Coned NH 3 , 55°C, 16 h 

3- Liophilization 

4- Ethanol precipitation 



5^Polynucleotide-3V 0 -|t 0 ^o^. 0 .p, 



o. j> 
o 



45 Ha 



v 



$10 



Used for the detection of 
chemically catalyzed bond 
formation 



45 Na® © 

5*- Polynucleotide-^^, p. Q 



I M \ 0*00 \ U 



S9 



00 

Na 0 

Used for the detection of 
enzyme catalyzed bond 
formation 



Figure 6 
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1- HBTU, Et 3 N 

2- H 2 , Pd/C 
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27 

1- 24, HBTU, Et 3 N 

2- H 2 , Pd/C 

3- Glutaric anhydride, Et 3 N 



o o 



0 .Vt\v o 

Polystyrene 



66 



1- S8, HBTU. HOBt, DIEA 

2- TFA/DCM (1/4) 



Matrix- 1st spacers' 



1- S8, HBTU, HOBt, DIEA 

2- TFA/DCM (1/4) 



West side of the reaction cassette 
Ready for use tn the detection of 
enzyme catalyzed bond formation 



C34 (Matrix = NSKD) 
C35 (Matrix « TentaGel) 



S9, a-Chymotrypsin 



MatrixOst «pattHB- K <p^ 

°# e -° / \ 9 o 
3'-Polynucleotide-5^ 0 .P- 0 -K^.oi^ 0 .P- @ 

© V /5 0 © 

45Na Na 



PGR on solid support 



Figure 7 
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Polystyrene 



-66 



NH| 



20% vN Gtutaratdehyde 
0.05 M phosphate butler, pH 8.5 



Porystyrene xX ^ 0 ^o N ^ N *^x\A 0 036 



Polystyrene^^ 



S10, 60% DMSO/0.1 M borate buffer pH 10 



o / \ © 



s 0 



1- 0.1 M phosphate buffer, pH 8.5. 0.2 M NaBH 3 CN 

2- Extensive washing 



e 



Na 



0 
45Na 



Polystyrene,, 



\ /I66 M 



o. .o 

*0 



£o^°^o-^°;?;°^Po^ 
Na *5Na 



© 



PCR on solid support 



Figure 8 
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Template A 
Template 6 
Template C 
Template D 
Template E 
Template F 
Template G 
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Primer I 



Encoding sequence 
A 



Primer 2 



/ — \/ — : \/ n 

5'- AGO TAC TTC CCA AGGGAG CTG CTG CTA GTC GGG CCC TAT TCT TAG- 3' 

5'- CTA AGA ATA GGG CCC GAC TAG CAG CAG CTC CCT TGG GAA GTA GCT- 3' 

5*. GGC GGC TTC CCA AGG GAG CTG CTG CTA GTC TAT TCT TAG GGG CCO 3* 

5 - GGC GGC TTC CCA AGG GAG CTG CTG CTA GTC TAG GCG TAG GGG CCC- 3* 

5'- GGC GAC GTG ATG GGC AAT TTG ATG ATA GAC TAG GCG GAG GCG AGG- 3* 

5'- CGC AGG GTG AGT AAG GCA GCA TAC GCA GCA ATA GTG GAC GGA GCG- 3' 

5'- AGC TAC TTC CCA AGG GAG CTC CTG CTA GTC GGG CCC TAT TCT TAG • 3 



Template A - 
Template B : 
Template C 
Template D 
Template E : 
Template F : 
Template G 



> SEQUENCE ID NO: 9 

: SEQUENCE ID NO: 10 
: SEQUENCE ID NO: 11 
: SEQUENCE ID NO: 12 
: SEQUENCE ID NO: 13 
= SEQUENCE ID NO: 14 

> SEQUENCE ID NO: 8 



Figure 9 
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1st Spacers 



r 



2nd Spacers 



Potynucteotide-3' 



Matrix 



Y 

Substrate 



~v — 
Tag 



Figure 10 
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Matrix 

Substrate 

Cassette 

a-Chymotrypsin 

Lane 



45mer^_ 
Primers dimers— 
Primers'" 



TG-NH2 



NSKD-NH 2 



S7 C ontro ls 
C32 1 ' 




Figure 12 
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